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Abstract 

A  firm  understanding  of  the  space  environment  is  necessary  to  defend  US  access  to 
space-based  systems.  Conventional  imaging  systems  have  been  developed  to  gather  infor¬ 
mation  on  space-based  objects,  but  they  are  incapable  of  imaging  objects  in  the  earth’s 
shadow.  In  order  close  this  gap  in  imaging-system  coverage,  an  active-illumination  based 
approach  must  be  used.  To  facilitate  this,  a  multi-frame  active  phase  diversity  imaging 
(APDI)  algorithm  is  derived  and  demonstrated  for  the  statistics  of  coherent  light.  In  ad¬ 
dition  to  conventional  focal-plane  and  diversity-plane  data,  a  statistical  description  for  the 
pupil-plane  intensity  distribution  is  formed  and  included  in  the  derivation.  The  algorithm 
is  implemented  and  characterized  using  a  Monte  Carlo  approach.  Analysis  shows  that  the 
algorithm  is  robust,  that  the  effect  of  system  configuration  on  optimal  algorithm  parameters 
is  minimal,  that  the  algorithm  is  insensitive  to  detection  noise  for  SNR  >  7,  and  that  it  per¬ 
forms  well  for  SNR’s  as  low  as  2.  Furthermore,  it’s  shown  that  introduction  of  pupil- plane 
data  on  average  results  in  a  60%  better  image  reconstruction  from  dynamically  aberrated 
data  than  is  obtained  using  only  focal-plane  and  diversity-plane  data. 

Both  an  Expectation-Maximization  algorithm  and  a  lensless- APDI  approach  are  pre¬ 
sented  for  generating  imagery  directly  from  pupil-plane  polarization  measurements.  Short¬ 
falls  of  these  methods  and  areas  worthy  of  further  consideration  are  identified.  The  use  of 
pupil-plane  polarization  state  measurements  in  place  of  pupil-plane  intensity  measurements 
in  the  APDI  algorithm  is  explored.  A  framework  for  including  polarization  measurements 
into  the  APDI  algorithm  is  demonstrated,  and  an  initial  statistical  model  and  results  are  pre¬ 
sented.  Under  the  developed  implementation,  introduction  of  the  polarization  data  doesn’t 
result  in  better  performance.  Areas  that  may  result  in  better  reconstructions  are  discussed. 
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Phase  Diversity  and  Polarization  Augmented 


Techniques  for  Active  Imaging 

I.  Introduction 

Recent  military  engagements  in  Iraq  and  Afghanistan  have  shown  the  great  impact  that 
space-based  systems  can  have  on  the  battlefield.  The  strongly  asymmetric  advantage 
in  space  enjoyed  by  the  United  States  armed  forces  has  been  cited  as  one  of  the  largest 
factors  in  the  tactical  success  of  Operation  Iraqi  Freedom  and  the  Global  War  on  Terror  in 
general  [16].  With  growing  dependence  on  space-based  assets,  it’s  becoming  more  and  more 
important  to  protect  those  assets  and  ensure  uninterrupted  availability  to  the  war-fighter. 
Gen.  Lance  W.  Ward,  commanding  general  Air  Force  Space  Command,  warned  “[US  control 
of  space]  is  not  a  birthright  or  a  destiny.”  [27]  Recognizing  this  fact,  the  United  States  Air 
Force  has  made  protection  of  space-based  assets  a  high  priority. 

1.1  Motivation 

In  order  to  secure  access  to  and  use  of  space-based  assets,  a  clear  understanding  of 
the  space  environment  and  associated  threats  known  as  Space  Situational  Awareness  (SSA) 
is  required.  To  acquire  the  desired  level  of  SSA,  tools  and  techniques  for  identifying,  track¬ 
ing,  cataloging,  monitoring,  characterizing,  and  assessing  objects  in  orbit  have  been  and 
continue  to  be  developed  and  improved.  One  of  the  most  desirable  analytical  tools  for  char¬ 
acterizing  or  assessing  an  extra-terrestrial  object  is  the  ability  to  generate  high-resolution 
imagery  of  that  object.  High-resolution  imagery  can  provide  a  wealth  of  information  about 
the  object  of  interest,  and  is  critical  for  the  identification  of  unknown  objects  or  monitoring 
of  foreign/friendly  systems.  Imagery  can  provide  information  required  to  positively  iden¬ 
tify  unknown  systems,  identify  payloads  on  potentially  hostile  objects,  and  assist  in  the 
troubleshooting  of  failed  systems. 

While  techniques  exist  for  imaging  satellites  (see  for  example  [29,39,42]),  optimal 
use  of  conventional  methods  cannot  cover  all  of  the  engagement  scenarios  critical  to  SSA. 
Conventional  nighttime  imaging  systems  can  only  image  a  satellite  during  the  brief  period 
near  terminator  when  the  observation  system  is  in  the  earth’s  shadow  and  the  object  is 
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illuminated  by  the  sun.  However,  there  are  many  desirable  engagement  scenarios  that  place 
the  satellite  beyond  terminator  and  in  the  earth’s  shadow.  In  this  case  there  is  no  light 
being  reflected  from  the  object  with  which  to  form  an  image,  and  conventional  imaging 
systems  fail. 

As  an  alternative  approach,  the  unique  properties  of  laser  light  can  be  used  to  overcome 
some  of  these  limitations  and  provide  SSA  information  that  would  otherwise  be  unavailable. 
The  ability  of  lasers  to  generate  intense  focused  optical  radiation  can  be  used  to  overcome 
the  requirement  for  solar  illumination  by  effectively  generating  a  “flash  bulb”  for  the  imaging 
system  that  shifts  it  from  a  passive  to  an  active  imaging  mode.  Unfortunately,  the  same 
coherence  properties  that  make  it  possible  to  project  intense  light  over  long  distances  also 
introduce  complications  unique  to  coherent  light.  This  work  attempts  to  compensate  for 
and  even  leverage  these  coherence  properties  to  generate  high-resolution  imagery  of  exo- 
atmospheric  objects. 

1.2  Overview 

Both  active  and  conventional  imaging  systems  suffer  from  the  effects  of  atmospheric 
turbulence.  As  a  result,  any  system  designed  to  produce  high-resolution  imagery  must  be 
capable  of  compensating  for  or  otherwise  negating  the  aberration  and  associated  blurring 
caused  by  atmospheric  turbulence.  One  technique  for  compensating  for  the  blurring  effects 
of  the  atmosphere  is  the  use  of  phase  diversity  (PD)  image  reconstruction  [23,39].  In 
conventional  PD  imaging,  two  images  are  simultaneously  collected  where  the  first  image 
is  obtained  in  the  conventional  focal  plane  (FP),  and  the  second  is  obtained  in  a  diversity 
plane  (DP)  that  is  defocused  from  the  focal  plane  by  some  known  amount.  The  data  is  then 
post- processed  to  produce  an  estimate  of  both  the  aberration  and  the  object. 

Because  of  the  severity  of  laser  speckle  noise,  a  multi-frame  approach  is  used  to  derive  a 
maximum-likelihood  active  phase  diversity  imaging  (APDI)  reconstruction  algorithm  similar 
to  that  of  Seldin  et  al.  [45].  The  resulting  algorithm  is  essentially  a  hybrid  between  multi¬ 
frame  blind  deconvolution  and  conventional  PD  reconstruction  techniques.  In  addition  to 
the  modifications  required  by  speckle  noise  statistics,  additional  information  made  available 
by  coherent  illumination  is  included  in  the  derivation  to  better  condition  the  reconstruction. 
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Specifically,  a  significant  amount  of  information  about  the  object  being  imaged  is  encoded 
in  the  imaging  system’s  pupil-plane  (PP)  intensity  distribution.  A  statistical  description 
of  the  PP  intensity  is  developed  and  incorporated  into  the  derivation  of  the  reconstruction 
algorithm.  The  completed  APDI  algorithm  is  implemented  and  characterized  using  Monte- 
Carlo  style  simulations. 

Two  scenarios  are  developed.  First,  a  version  of  the  algorithm  tailored  to  static  aber¬ 
rations  is  developed.  This  configuration  covers  situations  where  the  system  suffers  from  an 
unknown  systemic  aberration  or  when  the  time  between  frames  is  much  smaller  than  the 
atmospheric  decorrelation  time.  The  non-tinre-varying  nature  of  the  atmospheric  aberration 
simplifies  the  problem,  making  the  resulting  computations  significantly  faster.  The  second 
scenario  considered  is  the  case  where  atmospheric  turbulence  dynamically  evolves  and  re¬ 
alizations  are  completely  decorrelated  between  data  frames.  This  configuration  represents 
the  most  interesting,  practical,  computationally  expensive,  and  challenging  system. 

The  effects  of  key  parameters  (conditioning  bias  and  convergence  tolerance)  on  algo¬ 
rithm  performance  are  evaluated  for  both  static  and  dynamic  aberrations  by  varying  each 
parameter  and  generating  100  random  data  realizations.  The  residual  mean-squared  error 
(MSE)  between  the  reconstructed  image  and  the  known  truth  object  is  computed  for  each 
realization.  The  effect  of  parameter  variation  on  MSE  is  used  to  determine  optimal  param¬ 
eter  values  for  multiple  scenarios.  Overall  performance  in  the  presence  of  detection  noise  is 
characterized  by  generating  100  random  realizations  with  signal  to  noise  ratio  (SNR)  values 
ranging  SNR  =  2  to  SNR  =  oo  with  10,  20,  30,  40,  and  50  frames  of  data,  then  processing 
them  with  near  optimal  bias  and  convergence  tolerance.  MSE,  mean  computation  time,  and 
average  function  evaluations  are  determined  for  each  reconstruction.  The  effect  of  including 
additional  terms  in  the  reconstructed  turbulence  representation  is  investigated  by  recon¬ 
structing  images  using  5,  10,  15,  20,  30,  50,  and  100  Zernike  modes  and  again  comparing 
MSE,  computation  time,  and  number  of  function  evaluations. 

To  quantify  the  impact  of  adding  PP  data  to  the  algorithm,  versions  of  the  algorithm 
excluding  the  PP  data  and  alternatively  the  DP  data  are  compiled  and  run  along  with 
the  complete  algorithm  on  100  data  realizations.  MSE,  convergence  time,  and  function 
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evaluations  are  computed  for  each  realization  and  compared  between  the  three  algorithm 
configurations. 

As  an  alternative  approach  to  active  imaging,  three  methods  are  developed  for  recon¬ 
structing  images  using  PP  polarization  data.  First,  a  statistical  model  for  the  PP  polariza¬ 
tion  state  is  developed  and  used  to  build  the  framework  for  an  Expectation-Maximization 
(EM)  algorithm  for  reconstructing  the  image  directly  from  PP  polarization  state  measure¬ 
ments.  Second,  the  PP  polarization  data  is  mathematically  shaped  to  fit  within  the  frame¬ 
work  of  the  APDI  algorithm,  and  the  algorithm  is  modified  accordingly.  Finally,  in  an  effort 
to  expand  the  information  available  to  the  initial  APDI  algorithm  and  produce  better  re¬ 
sults,  a  statistical  model  for  polarization  phase  data  is  developed,  and  the  APDI  algorithm 
expanded  to  utilize  PP  polarization  state  measurements  in  place  of  PP  intensity  measure¬ 
ments. 

1.3  Notation  and  Conventions 

To  the  greatest  extent  possible,  consistent  notation  is  used  throughout  the  text.  Where 
possible,  a  given  symbol  retains  its  interpretation  once  defined.  Most  symbols  used  through¬ 
out  the  text  are  included  in  a  list  of  symbols  located  in  the  prefatory  material.  Symbols 
not  defined  in  the  list  of  symbols  either  change  depending  on  context,  or  are  only  used  in 
a  limited  section  of  the  text  and  not  referenced  elsewhere.  A  list  of  abbreviations  is  also 
included  in  the  prefatory  material  for  reference. 

The  general  geometry  used  for  all  notation  in  this  work  is  shown  in  Fig.  1.1.  Propa¬ 
gation  is  taken  to  be  along  the  z  axis.  The  coordinate  set  (£,  rj)  is  used  for  the  object  plane 
(OP),  and  occasionally  for  an  intermediate  mathematical  plane.  The  coordinate  set  (u,v) 
is  used  for  pupil-plane  (PP)  measurements  and/or  a  frequency-domain  coordinate  system. 
The  coordinate  system  (x,  y )  is  used  for  the  FP,  DP,  and  all  data  arrays  that  directly  inter¬ 
face  with  the  APDI  algorithm,  regardless  of  whether  or  not  they  correspond  with  a  physical 
plane. 
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Figure  1.1:  Generalized  geometry  showing  notational  conventions.  (£,77)  are  used  for 

object-plane  coordinates,  (it,  v)  for  pupil-plane  coordinates,  and  (x,  y)  for  data  plane  coor¬ 
dinates.  z  is  used  for  the  coordinate  along  the  direction  of  propagation  in  all  planes. 

1.4  Document  Organization 

The  document  is  organized  as  follows:  Chapter  II  introduces  estimation  and  optimiza¬ 
tion  theory  applicable  to  the  active  imaging  problem  and  image  reconstruction  techniques. 
Chapter  III  describes  applicable  optical  phenomena  including  light  propagation,  coherence 
theory,  and  applicable  statistical  characteristics  of  laser  speckle  patterns.  Chapter  IV  intro¬ 
duces  the  theory  and  models  used  for  imaging  system  performance  and  atmospheric  turbu¬ 
lence.  Chapter  V  contains  the  development  of  the  APDI  algorithm.  Chapter  VI  presents 
the  results  of  Monte  Carlo  analysis  of  the  APDI  algorithm  and  a  discussion  of  their  signif¬ 
icance.  Chapter  VII  builds  the  framework  for  use  of  PP  polarization  state  measurements 
in  image  reconstruction  and  presents  the  initial  results  obtained.  Chapter  VIII  summarizes 
the  contributions  and  conclusions  reached  as  a  result  of  this  work  and  highlights  areas  for 
future  work.  Computer  code  used  to  implement  the  APDI  algorithm  is  included  in  the 
appendices. 


5 


II.  Applicable  Estimation  and  Optimization  Theory 


At  the  core  of  probabilistic  image  reconstruction  techniques  is  estimation  of  unknown 
information  based  on  available  data.  To  do  this,  a  basic  understanding  of  estimation 
and  optimization  theory  is  required.  This  chapter  covers  estimation  theory  and  optimization 
methods  applicable  to  building  the  statistically  based  reconstruction  algorithms  considered. 

2.1  Estimation  Theory 

All  information  covering  estimation  in  this  section  is  derived  or  taken  directly  from  [49] . 
In  the  most  basic  form,  estimation  is  simply  a  well-defined  statistical  “best  guess”  at  desired 
data  (parameters)  based  on  corrupted  or  incomplete  data  (observations)  using  a  set  of 
predefined  criteria  known  as  a  cost  function.  The  basic  estimation  problem  is  composed 
of  four  main  components  shown  in  Fig.  2.1:  the  parameter  space,  probabilistic  mapping, 
observation  space,  and  an  estimation  rule. 

Probablistic 


Figure  2.1:  Basic  elements  of  an  estimator  include  the  parameter  space  containing  the 

parameters  to  be  estimated,  an  observation  space  where  data  is  collected,  a  probabilistic 
mapping  relating  the  parameters  to  observations,  and  an  estimation  rule  linking  observations 
with  parameters. 

The  parameter  space  is  made  up  of  all  possible  parameters  and  contains  the  data 
that  corresponds  to  the  desired  “truth”  result.  The  output  of  the  estimation  problem 
is  an  element  of  the  parameter  space  which  hopefully  corresponds  to  the  “truth.”  The 
probabilistic  mapping  models  whatever  process  corrupts  the  parameters  to  be  estimated, 
and  is  posed  in  the  form  of  probability  distributions  describing  statistical  behavior  of  the 
parameters  and  observations.  The  observation  space  is  composed  of  all  possible  observations, 
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and  may  or  may  not  intersect  the  parameter  space.  The  estimation  rule  is  the  mapping  used 
to  map  elements  of  the  observation  space  back  into  the  parameter  space  and  thereby  form 
an  estimate  of  the  desired  parameter(s). 

The  probabilistic  mapping  is  frequently  a  function  of  the  operating  environment,  and 
cannot  in  general  be  altered  to  aid  the  development  of  an  estimator.  Similarly  the  obser¬ 
vation  and  parameter  spaces  are  not  typically  customizable  for  a  given  problem.  The  task 
then  becomes  development  of  an  adequate  estimation  rule. 

2.1.1  Bayesian  Estimation.  One  of  the  first  considerations  when  developing  an 
estimation  rule  is  what  exactly  must  be  accomplished.  For  example,  one  could  conceive 
a  situation  such  as  monitoring  for  a  toxic  substance  in  the  environment  where  overesti- 
rnation  of  a  parameter  might  be  relatively  acceptable,  but  underestimating  it  could  have 
disastrous  consequences.  Alternatively,  when  monitoring  something  like  oxygen  concentra¬ 
tion  in  breathable  air  the  converse  situation  could  be  true,  and  overestimation  could  be 
deadly.  Given  these  examples,  it  should  be  clear  that  no  one  estimation  rule  will  work  for 
all  applicable  problems. 

To  incorporate  requirement  variability  into  the  estimation  rule,  a  cost  function  C  is 
introduced.  In  general,  a  cost  function  is  a  scalar  valued  function  of  both  the  parameters 
and  the  observations  whose  value  is  proportional  to  the  “badness”  of  a  given  combination 
of  the  two.  Frequently,  the  cost  function  can  be  written  as  a  function  of  the  scalar  absolute 
error  Er ,  defined  as 

Er(d,o)  =  \\d{d)  -  o||2  (2.1) 

where  o  is  the  parameter  vector,  d  is  the  observation  vector,  b  is  the  estimate  of  o  generated 
by  the  estimation  rule,  and  ||-||2  is  the  standard  Euclidean  norm. 

While  the  cost  function  can  be  tailored  to  any  specific  application,  the  two  most 
commonly  used  cost  functions,  shown  in  Fig.  2.2,  are  the  mean-square  error  cost  function 
GVt.se ,  defined  as 


Cmse  =  [Er(d,o}\ 2 

=  {d-of{d-o) 
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(2.2a) 

(2.2b) 


which  increases  the  cost  quadratically  as  the  estimate  moves  further  away  from  the  true 
parameters;  and  the  uniform  cost  function  Cu  defined  as 


0,  Er  (6,  o)  <  y 

< 

1,  Er  (o,  o)  >  y 


(2.3) 


which  assigns  an  equal  cost  to  all  estimates  outside  of  a  neighborhood  of  width  A  centered 
about  the  true  parameter  vector,  and  assigns  no  cost  to  estimates  within  the  neighborhood. 


(a)  Mean-Square  Error 


(b)  Uniform 


Figure  2.2:  One-dimensional  mean-square  error  and  uniform  cost  functions  for  Bayesian 

estimation. 


Once  the  cost  function  has  been  selected,  a  Bayesian  estimation  rule  can  be  formed 
given  that  the  joint  probability  of  the  parameters  and  observations  p(o,  d )  is  known.  The 
Bayesian  estimation  rule  is  developed  by  minimizing  the  risk  1Z  defined  as 


TZ=E  [C(o,o)\ 
r  r  °° 

=  C(o,  o)p(o,  d)dddc 

J  J  —OO 


(2.4) 


where  E  [•]  represents  statistical  expectation. 

As  an  example,  the  Bayesian  estimation  rule  for  a  mean  square  error  cost  function 
can  be  found  by  substituting  (2.2)  into  (2.4)  and  minimizing 


n 


mse 


(6  —  o)T  (6  —  o)  p(o,  d)dddo 


(2.5) 
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Noting  that  p(o,d)  =  p(o\d)p(d ),  (2.5)  can  be  written 


'R'm.stp.  — 


/oo  /*oo 

p(d)  /  (6  —  o)T(o  —  o)p(o|d)dodcT 

-OO  J  —  OO 


(2.6) 


Because  both  p(d)  and  the  inner  integral  of  (2.6)  are  non-negative,  minimizing  the  inner 
integral 


/ 

J  — C 


(6  —  q)T(6  —  o)p{o\d)do 


(2.7) 


will  also  minimize  7 Zmse.  The  value  of  6  that  minimizes  this  integral  is  then  the  minimum- 
mean-square  estimate  dms . 

To  minimize  the  inner  integral,  (2.7)  is  differentiated  with  respect  to  6  and  the  result 
set  equal  to  zero.  Using  standard  definitions  from  vector  calculus,  the  derivative  of  (2.7)  is 
given  by 


H  r°°  _  r°°  H 

—  /  (d  —  o)T(d  —  d)p(o\d)do  =  /  —  (6  —  o)T(d  —  o)p{o\d)do 

bo  J —oq  J — oo  bo 


=26 


/OO  /*oo 

p(o\d)do  —  2  /  dp(o|cT)dd 

-oo  J  —oo 


(2.8a) 

(2.8b) 


Setting  (2.8)  equal  to  zero  and  noting  that  f^°oop(o\d)dd  =  1,  the  best  estimate  dms  is  given 
by 

/OO 

op(o\d)do  (2.9) 

-OO 

which  is  the  expected  value  of  the  conditional  probability  distribution  function  p(o\d). 


2.1.2  Maximum  A  Posteriori  Estimation.  Another  noteworthy  example  is  the 
Bayesian  estimator  that  results  from  the  uniform  cost  function  of  (2.3)  when  A  is  made 
arbitrarily  small.  This  estimation  rule  is  given  the  special  name  maximum  a  posteriori 
(MAP)  estimation.  The  risk  for  the  associated  cost  function  is  given  by 


n 


map  — 


p(d)dd 


1  - 


ro+ A/2 

/  p(d\d)do 
Jo-  A/2 


(2.10) 


As  before,  the  integral  can  be  minimized  by  minimizing  the  inner  term,  in  this  case  the  term 
in  the  brackets.  For  an  arbitrarily  small  A,  it  can  be  seen  that  the  minimum  of  the  term 
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in  brackets  is  achieved  when  the  a  posteriori  probability  distribution  function  p(o\d)  is  at  a 
maximum.  The  problem  them  becomes  determination  and  maximization  of  the  distribution. 

Frequently,  the  distributions  have  the  parameter  of  interest  in  an  exponent,  and  it 
becomes  convenient  to  work  with  the  logarithm  of  the  distribution.  Given  that  the  loga¬ 
rithm  is  a  monotonically  increasing  function,  maximization  of  the  logarithm  is  equivalent 
to  maximizing  the  distribution  function.  Therefore  at  the  MAP  estimate  value 

5  In  p(o\d) 
do 

Bayes  rule  can  then  be  used  to  separate  (2.11)  into  a  priori  knowledge  and  observation 
knowledge. 

=  —(in  p(d\o)  +  In  p(ofj  =0  (2-12) 

O—O  map  0=Omap 

where  constant  terms  have  been  dropped  and  £(•)  is  known  as  the  log-likelihood  function. 


=  0 


(2.11) 


2.1.3  Maximum  Likelihood  Estimation.  Unfortunately,  the  parameter  distribution 
p{o)  is  frequently  unknown.  Without  prior  information  about  the  parameter  distributions, 
the  best  that  can  be  done  is  to  assume  that  the  parameters  are  uniformly  distributed 
throughout  the  parameter  space.  Applying  this  to  the  MAP  estimation  described  in  the 
previous  section,  p(o)  is  constant  and  can  be  dropped.  This  results  in  the  maximum  likeli¬ 
hood  (ML)  estimate  given  by 


dL{o) 


do 


O — Omap 


d  In  p(d\o) 


do 


O — Omap 


=0 


(2.13) 


2.1.4  Expectation  Maximization  Algorithm.  In  many  cases,  a  closed  form  for 
the  maximum-likelihood  estimate  cannot  be  found  because  either  the  statistical  model  for 
the  data  contains  parameters  that  are  unknown,  or  the  likelihood  function  cannot  be  easily 
differentiated.  In  these  cases,  an  alternative  approach  must  be  taken.  If  something  is  known 
about  the  statistics  of  the  missing  data,  or  the  likelihood  function  can  be  simplified  by  adding 
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additional  “hidden”  data,  the  Expectation-Maximization  (EM)  algorithm  [5,13,15,36,42] 
can  be  used  to  form  an  estimate. 

Within  the  framework  of  the  EM  algorithm,  the  measured  data  d  is  termed  the  “in¬ 
complete”  data  (ID),  and  is  generated  from  a  subset  of  “complete”  data  (CD)  that  includes 
the  unknown  or  “hidden”  data  d.  The  problem  is  modeled  in  terms  of  the  joint  probabil¬ 
ity  density  function  of  the  complete  data  conditioned  on  the  parameters  o  that  are  to  be 
estimated 

p(d,  d\o)  =  p(d\d,  o)p(d\o)  (2-14) 

In  the  event  that  the  “hidden”  data  is  artificially  inserted  into  the  problem,  the  marginal 
distribution  p(d\d,o)  is  chosen  such  that  it  puts  the  joint  density  function  into  a  form 
that  is  both  convenient  for  calculation  and  statistically  consistent  with  the  measured  data. 
Otherwise,  the  marginal  distribution  is  determined  by  the  nature  of  the  problem. 

Once  the  joint  probability  distribution  function  has  been  determined,  the  CD  log- 
likelihood  function  for  o  can  be  formed  such  that 

L(o\d,  d)  =  lnp(d,  d, \o)  (2-15) 


With  the  CD  log-likelihood  function  defined,  the  statistical  model  for  the  system  is  complete 
and  the  algorithm  can  proceed.  As  the  name  suggests,  the  algorithm  consists  of  two  com¬ 
ponents;  an  expectation  component,  and  a  maximization  component.  In  the  expectation 
step,  the  CD  log-likelihood  function  given  the  incomplete  data  and  the  current  estimate  of 
the  parameters  is  expectated  over  the  “hidden”  data  such  that 


Q(o,6«)  =  E  \L(o\d,d)\d,oW 


=  E 


In 


Idev 


{p(d,  d|d)|  |  d.  oW 
p(d\ d,  o« )  In  |p(d*,  d|  o)  |  d d 


(2.16a) 

(2.16b) 

(2.16c) 


where  6^  is  the  current  estimate  of  the  parameter,  p(d\d,  6®)  is  the  marginal  density  of 
the  “hidden”  data  conditioned  on  the  incomplete  data  and  current  parameter  estimate, 
and  V  is  the  space  over  which  d  is  defined.  One  common  problem  is  that  the  marginal 
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density  of  the  “hidden”  data  is  unobtainable.  In  these  cases,  p(d\d,  oW)  is  often  replaced  by 
p(d,  djoW)  =  p(d\d,  d^)p(d\d^)  [5]. 

The  second  step  of  the  EM  algorithm  is  the  maximization  step  in  which  the  expected 
value  of  the  CD  log-likelihood  function  Q(o,d is  maximized  with  respect  to  o  using  any 
appropriate  optimization  technique  to  generate  an  updated  estimate  of  the  parameter  of*+1l 
such  that 

6(t+i)  =  argmax  Q{o,  ow)  (2-17) 

6 


2.2  Numerical  Optimization  Methods 

Frequently  the  maximization  involved  in  forming  the  MAP/ML  estimates  or  the  EM- 
iterates  cannot  be  efficiently  accomplished  analytically  due  to  the  size  of  the  problem  or 
complex  nature  of  the  objective  function  being  maximized.  In  these  cases,  iterative  numer¬ 
ical  solution  methods  can  be  used  to  generate  a  sequence  of  estimates  that  converge  to  the 
optimal  solution.  The  iterative  solution  techniques  in  this  section  are  adapted  from  [21] 
chapter  4,  and  are  used  to  minimize  a  scalar-valued  objective  function  F(o)  such  that 

6  =  argmaxl?(dj  (2-18) 

6 

To  utilize  these  methods  for  maximization,  the  objective  function  is  simply  multiplied  by 
-1. 

Nearly  all  numerical  optimization  techniques  are  based  on  the  same  basic  algorithm 
composed  of  the  following  major  steps: 

1.  Develop/define  an  initial  guess  for  the  estimate  that  is  consistent  with  the  problem. 

2.  Check  the  current  estimate  dk  of  the  solution  6  for  convergence.  If  the  convergence 
criteria  are  satisfied,  stop  the  iteration. 

3.  Compute  the  “optimal”  search  direction  given  6*.  and  the  objective  function  F(o). 

4.  Compute  the  “optimal”  step  size  op. 

5.  Update  the  estimate  dk+i  =  dfc  +  otkPk  to  produce  the  next  iterate. 
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6.  Repeat,  beginning  at  step  2. 

Determination  of  convergence  criteria  and  the  definition  of  “optimal”  are  based  on  the 
specific  method  being  used. 

Additional  terms  that  are  frequently  used  are  the  gradient  vector  g  (analogous  to  the 
first  derivative  for  a  univariate  function) 


VF  =  g(o) 


(  dF  \ 

doi 


9F_ 

\  don  / 


(2.19) 


and  the  Hessian  G  (analogous  to  the  second  derivative  for  the  univariate  case) 


V2F  =  G(o) 


do\don 


d2F  \ 
doidon 


d2  F 

~dc%  / 


For  convenience,  g(dk)  and  G(6fc)  will  be  written  gk  and  Gk  respectively. 


(2.20) 


2.2.1  Newton’s  Method.  For  purposes  of  iterative  optimization,  it’s  usually  suffi¬ 
cient  to  approximate  an  arbitrary  objective  function  F(dk  +  o-kPk)  as  a  quadratic  function 
through  a  Taylor-series  expansion  about  such  that  in  a  small  region  around  <%, 

Q ,2 

F(dfc  +  akpk )  «  Fk  +  akgkpk  +  —pkGkpk  (2.21) 

Given  this  quadratic  form,  the  minimum  will  occur  when  pk  is  a  minimum  of  the  quadratic 
function 

a2 

<S>(pk)  =  OLkQkPk  +  ypf  G  kpk  (2.22) 

Differentiating  and  setting  the  result  to  zero,  the  minimum  then  satisfies 


O^kGkPk  —  2k 


(2.23) 


Setting  ak  =  1  results  in  Newton’s  method  for  a  multivariate  function.  If  F  is  exactly 
quadratic  and  Gq  is  positive  definite,  Newton’s  method  converges  to  the  global  minimum, 
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F(d k  +  Pk)  =  F(o),  in  a  single  step.  Provided  the  approximation  of  (2.21)  is  relatively 
accurate,  the  initial  starting  point  do  is  not  too  far  from  the  minimum  point  o,  and  Gfc(o)  is 
positive  definite,  Newton’s  method  will  converge  quadratically  to  the  true  minimum  F(d). 

2.2.2  Quasi-Newton’s  Methods.  In  cases  where  Newton’s  method  fails  to  converge 
to  a  correct  solution,  alternative  solution  techniques  can  be  used.  For  example,  if  the 
Hessian  Gju(d)  is  not  positive  definite,  Quasi-Newton’s  methods  can  be  used  wherein  a 
positive  definite  “related  Hessian”  G^,  closely  related  to  G^,  is  formed  at  each  iteration. 
Generally,  G&  is  formed  using  some  form  of  factorization  on  G&.  The  search  direction  pk  is 
then  determined  by  solution  of  the  system 

GkPk  =  Qk  (2.24) 

The  two  most  common  factorizations  used  for  forming  G^  are  the  spectral  decompo¬ 
sition  given  by 

G  =  XJAXJh  (2.25) 

where  U  is  a  unitary  matrix  of  eigenvectors  and  A  is  a  diagonal  matrix  of  the  eigenvalues 
of  G,  and  a  modification  of  the  Cholesky  factorization  given  by 

G  =  LDL^  (2.26) 

where  L  is  a  unit  lower-triangular  matrix  and  D  is  a  diagonal  matrix.  These  factorizations 
both  provide  clear  indications  of  whether  or  not  a  given  matrix  is  positive  definite,  and 
what  must  be  done  to  make  it  so.  The  “related  Hessian”  is  then  formed  by  making  the 
“smallest”  changes  possible  to  the  true  Hessian  that  will  ensure  positive  definiteness.  For 
example,  if  the  modified  Cholesky  factorization  results  in  a  diagonal  matrix  with  negative 
entries,  those  negative  entries  are  replaced  with  an  appropriately  small  positive  number. 
The  magnitude  of  the  replacement  is  selected  based  on  the  maximum  acceptable  condition 
number.  Once  the  related  Hessian  has  been  formed,  the  iterations  proceed  as  with  standard 
Newton’s  method  solutions  except  that  a  new  related  Hessian  is  formed  at  each  iteration. 
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2.2.3  Other  Common  Methods.  In  cases  where  both  Newton’s  and  quasi-Newton’s 
methods  fail  to  converge  to  a  solution,  alternative  solution  methods  including  gradient- 
descent,  conjugate  gradient,  multi-grid,  and  other  related  numerical  solution  methods  can 
be  used.  For  descriptions  and  applications  of  these  methods  see  for  example  [14,21], 

2.2.4  Constrained  Optimization.  In  many  cases  physical  processes  result  in  con¬ 
straints  on  the  solution  vector  0  that  must  be  satisfied  for  it  to  be  a  valid  solution.  To 
accommodate  these  situations,  Lagrange  multipliers  can  be  used  to  incorporate  the  con¬ 
straints  into  the  problem  [18].  The  modified  problem  can  then  be  passed  to  one  of  the 
previously  discussed  solution  methods  to  find  the  optimal  solution  given  the  constraints. 

As  a  motivating  example,  define  the  objective  function  as 


F(o)  =  ohMo  (2.27) 

where  H  is  the  hermitian  transpose  and  M  is  a  hermitian  matrix,  and  constrain  the  solution 
such  that 

oHo  =  n  (2.28) 

Without  applying  the  constraints,  (2.27)  can  be  minimized  by  taking  the  derivative  with 
respect  to  o  and  setting  the  result  equal  to  zero,  resulting  in 


Mo  =  0  (2.29) 

If  M  is  non-singular,  the  minimizing  solution  is  then  o  =  0.  However,  the  problem  con¬ 
straints  eliminate  this  solution  from  consideration,  and  an  alternative  approach  has  to  be 
taken. 

To  accommodate  constraints,  the  Lagrangian  is  formed  using  the  objective  function 
F(o),  the  constraints,  and  a  vector  of  Lagrange  multipliers  such  that 

L(d,\l)  =  F(o)  +  Xfc(o)  (2.30) 
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where  F(o)  is  as  defined  in  (2.27),  A;  is  a  vector  of  Lagrange  multipliers,  and  c(o)  is  a 
vector-valued  function  that  when  the  constraints  are  met  satisfies 

c(o)  =  0  (2.31) 

where  0  is  a  vector  of  zeros.  L(o,  A/)  is  then  minimized  using  any  applicable  method  for 
unconstrained  problems,  and  A;  is  selected  to  satisfy  (2.31)  exactly.  If  (2.31)  is  satisfied  for 
multiple  values  of  A i,  the  A i  that  results  in  the  minimum  value  of  the  objective  function 
F(o)  is  selected. 

Considering  the  objective  function  of  (2.27)  with  the  associated  constraint  of  (2.28), 
the  constraint  function  c(x)  can  be  written 

c(o)  =  n-  oHo  (2.32) 

where  n  is  the  number  of  elements  in  o.  In  this  case,  c(o)  is  a  scalar-valued  function,  reducing 
the  vector  of  multipliers  to  a  single  scalar  A/  =  A ;.  Applying  this  to  (2.30)  results  in  the 
expression 

L(o,  A i)  =  oHMo  —  XioHo  +  n  (2.33) 

Differentiating  and  setting  this  equal  to  zero  results  in 

Mo  =  A  id  (2.34) 

which  is  simply  an  eigenvalue  problem.  The  eigenvector  corresponding  to  the  smallest 
eigenvalue  then  minimizes  F{o)  subject  to  the  constraints  of  (2.28). 

With  a  sufficient  description  of  the  required  statistical  estimation  methods  complete, 
the  next  step  in  building  an  image  reconstruction  algorithm  is  development  of  a  system 
model.  However,  before  a  statistically-based  system  model  can  be  built,  an  understanding 
of  the  applicable  physical  processes  is  required,  and  will  be  developed  in  Chapters  III  and 
IV. 
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III.  Optical  Phenomena 


Active  imaging  techniques,  including  those  to  be  considered  here,  rely  heavily  on  an 
understanding  of  certain  fundamental  properties  of  light  including  polarization,  prop¬ 
agation  and  diffraction,  and  coherence.  This  chapter  provides  a  brief  overview  of  the  optical 
phenomena  applicable  to  the  active  imaging  problem.  The  chapter  is  broken  up  into  four 
main  areas:  the  first  covers  polarization  and  depolarization  phenomena,  the  second  covers 
propagation  and  diffraction  of  light,  the  third  introduces  the  coherence  theory  that  will  be 
used,  and  the  fourth  introduces  laser  speckle  statistics  that  are  integral  to  the  active  imaging 
problem. 

3.1  Polarization  and  Associated  Effects 

As  a  form  of  electro-magnetic  radiation,  the  propagation  of  light  is  governed  by 
Maxwell’s  equations,  which  in  differential  form  are  given  by  [51] 


-  BE 

V  xE  +  —  =  0 
ot 

(3.1a) 

-  Bn 

v  x  77  — —  =  J 
dt 

(3.1b) 

V  ■  D  =  p 

(3.1c) 

V-B  =  0 

(3.  Id) 

where  E  is  the  electric  field  vector,  B  is  the  magnetic  flux  density,  TC  is  the  magnetic  field 
vector,  D  is  the  electric  displacement,  J  is  the  free  current  density  in  the  material,  and  p 
is  the  free  charge  density  in  the  material.  Given  the  vector  nature  of  these  equations,  it’s 
clear  that  any  electro- magnetic  field  will  have  intrinsically  vector  qualities.  Specifically,  in 
free-space  the  wave  vector,  electric  field  vector,  and  the  magnetic  field  vector  are  jointly 
orthogonal  and  form  the  basis  of  a  right-handed  coordinate  system  [8] .  This  characteristic 
translates  into  a  “directional”  property  of  the  field  that  can  be  described  by  assigning 
polarization  directions  and  writing  the  field  in  terms  of  the  components  along  each  of  the 
assigned  polarization  directions. 

3.1.1  Polarization.  If  the  media  through  which  the  radiation  is  passing  is  homo¬ 
geneous,  linear,  isotropic,  and  non-dispersive,  the  solution  to  (3.1)  will  satisfy  the  pair  of 
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vector  Helmholz  equations  given  by 


h2  d2E 

(3.2a) 

v2e- 

c 2  dt 2 

h2  d2U 

(3.2b) 

V27? - 

=  0 

c2  dt2 

where  h  is  the  index  of  refraction  and  c  is  the  speed  of  light.  Because  both  E  and  7i  obey  the 
same  vector  wave  equation,  an  identical  scalar  wave  equation  can  be  written  to  completely 
describe  each  vector  component  [25] 


V2t/- 


h2  d2JJ 

c 2  dt2 


(3-3) 


where  U  can  represent  any  of  the  field  components.  This  implies  that  as  long  as  the  as¬ 
sumptions  of  a  homogeneous,  isotropic,  linear,  non-dispersive  media  are  accurate,  the  field 
can  be  described  by  considering  each  component  independently  using  scalar  theory. 

One  consequence  of  this  property  is  that  specification  of  the  electric  field  in  free  space 
can  be  used  to  completely  describe  the  magnetic  field.  Therefore,  only  the  electric  field  will 
be  considered.  Defining  the  polarization  directions  of  a  field  propagating  in  the  z  direction 
to  be  along  x  and  y ,  the  electric  field  can  be  written 


E  =  Exi  +  Eyj 


(3.4) 


where  i  and  j  are  unit  vectors  along  the  x  and  y  directions  respectively.  Ex  and  Ey  are  the 
field  components  projected  onto  i  and  j  given  by 


Ex  =A\  cos  (cut  —  kz  +  9 1) 
Ey  =A-2  cos  (cut  —  kz  +  62) 


(3.5) 


where  A\  and  A2  are  real  scalar  amplitudes,  k  is  the  wavenumber,  cu  is  the  angular  frequency, 
and  9 1  and  62  are  arbitrary  phases  in  the  interval  (— n,  7r] . 
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To  suppress  the  time  and  z  dependence  of  (3.5),  they  are  written  in  the  form 


Ex 

—  =  cos(wf  —  kz )  cos(0i)  —  sin(o;t  —  kz )  sin(0i) 
A\ 

E 

—r -  =  cos(u/f  —  kz)  cos(02 )  —  sin(u/i  —  kz)  sin(02) 
A2 


(3.6) 


leading  to 


E  E 

— -  sin(02)  — sin(0i)  =cos {ut  —  kz)  sin(02  —  0i) 
A\  A  2 

E  E 

-A  cos ($2)  — -7 -  cos(0i)  =sin {ut  —  kz)  sin(02  —  0i) 
A\  A2 


(3.7) 


which  are  then  squared  and  added  to  produce 


E, 


A, 


E, 


-A]  +  lwl  -2 


A-. 


,  EXEy 

‘MM 


cos(0i2)  =  sin2  (0i2 ) 


(3.8) 


where  $12=  02  —  0\.  Equation  (3.8)  describes  an  ellipse  that  has  been  rotated  in  the  plane 
by  an  angle  given  by 

9/7  F! 

(3.9) 


t  2ExEy 
^  ^ COS  012 


El  +  El 


and  is  known  as  the  polarization  ellipse  [8],  illustrated  in  Fig.  3.1. 


Figure  3.1:  The  polarization  ellipse  describes  the  temporal  evolution  and  polarization 

state  for  elliptically  polarized  radiation.  The  eccentricity  and  angle  describe  the  relationship 
between  the  x—  and  y— polarized  fields.  The  tip  of  the  electric  field  vector  traces  the  ellipse 
with  an  angular  frequency  of  ui. 
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If  the  time-dependent  nature  of  the  field  is  again  considered,  it  can  be  seen  that  the 
tip  of  the  electric  field  vector  traces  the  ellipse,  rotating  with  an  angular  frequency  of  u  and 
the  direction  or  sense  of  rotation  depending  on  @12  [8].  For  0  <  612  <  7r,  the  electric  field 
vector  would  appear  to  rotate  in  a  clockwise  fashion  when  viewing  the  (x,  y )  plane  from  a 
point  on  the  positive  ^  axis.  By  convention  in  the  optical  community  this  sense  of  rotation 
is  called  a  right-handed  polarization.  When  7 r  <  d\2  <  2ir  the  electric  field  vector  would 
appear  to  rotate  in  a  counter-clockwise  manner  and  be  termed  a  left-handed  polarization.  It 
should  be  noted  that  these  conventions  for  right-  and  left-handed  polarizations  are  reversed 
from  those  normally  used  by  the  RF-microwave  community. 

One  special  case  of  interest  is  when  the  polarization  state  randomly  changes  on  a 
time  scale  that  is  fast  compared  with  the  time-resolution  of  the  detection  system,  or  when 
the  field  is  composed  of  a  superposition  of  many  randomly  polarized  components.  This  is 
referred  to  as  unpolarized  radiation,  and  is  characterized  by  a  lack  of  observable  “preferred 
direction.” 

3.1.2  Stokes  Parameters.  With  the  polarization  ellipse  and  sense  of  rotation  spec¬ 
ified,  the  description  of  a  fully  polarized  wave  of  frequency  co  traveling  through  free-space 
along  the  z  direction  is  complete.  However,  the  high  frequencies  and  short  wavelengths  of 
light  make  it  impossible  to  directly  measure  the  amplitude  and  phase  of  the  incident  field. 
Conventional  detection  schemes  rely  on  detecting  the  incident  intensity/photon  flux.  Addi¬ 
tionally,  light  that  is  only  partially  polarized  needs  additional  information  to  satisfactorily 
describe  it. 

One  scheme  that  adequately  deals  with  these  complications  is  the  characterization  of 
the  light  in  terms  of  the  Stokes  vector  S,  defined  by  [8] 

S=[SQ,S1,S2,S3]  (3.10) 
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where 


50  — A2  +  A% 

51  =A2  -  A2 

(3.11) 

$2  =2A}A2  cos(0!2) 

S3  =2AiA2  sin(0i2) 

Using  a  small  amount  of  mental  gymnastics,  algebra,  and  intuition,  it  can  be  seen  that  So  is 
proportional  to  the  total  intensity  of  the  light,  Si  is  the  predominance  of  the  x  polarization 
over  the  y  polarization,  S2  is  the  predominance  of  left-450  linear  over  right-450  linear  po¬ 
larization,  and  S3  is  the  predominance  of  right-circular  over  left-circular  polarization.  With 
the  aid  of  linear  polarizers  and  a  wave  plate,  all  of  these  quantities  can  be  directly  measured 
using  conventional  detection  methods. 

It  should  be  noted  that  the  polarization  state  of  a  fully  polarized  wave  is  completely 
described  by  three  parameters,  A\ .  A 2,  and  6*12;  and  that  the  Stokes  parameters  provide 
four.  In  the  event  that  the  wave  is  fully  polarized,  Sq  =  Sf  +  S|  +  S|  and  one  of  the 
parameters  is  degenerate.  However,  when  the  wave  is  partially  polarized,  the  relationship 
between  So  and  the  remaining  parameters  is  no  longer  valid.  The  extra  information  now 
available  makes  it  possible  to  characterize  partially  polarized  light  as  well. 

Partially  polarized  light  can  be  broken  down  into  the  sum  of  fully  polarized  and  fully 
unpolarized  components.  In  this  case,  the  stokes  parameters  can  be  written  as 


Su=  [(1-P)S0, 0,0,0] 
Sp=  [7>So,Si,S2,S3] 


(3.12) 


where  the  degree  of  polarization  V  is  the  ratio  of  the  polarized  intensity  with  the  total 
intensity,  and  can  be  written 

v  sl  +  sl  +  sl 
Si 
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3.1.3  Depolarization.  Because  one  of  the  imaging  techniques  being  considered 
relies  heavily  on  the  changes  to  the  polarization  state  after  reflection  from  a  distant  object,  an 
area  of  particular  interest  is  the  variation  induced  in  the  polarization  state  by  the  reflecting 
object.  The  changes  in  the  polarization  state  stem  primarily  from  the  same  fundamental 
process,  manifested  in  three  forms:  change  after  reflection  from  a  smooth  surface  induced 
by  the  difference  in  material  reflectivity  for  the  two  principle  polarization  components, 
depolarization  due  to  multiple  reflections,  and  depolarization  due  to  scattering  from  rough 
surfaces  [4]. 

The  simplest  depolarization  mechanism,  and  the  foundation  for  all  the  applicable 
depolarization  mechanisms,  is  caused  by  the  variation  between  reflection  coefficients  for  the 
polarization  components  parallel  (TM)  and  perpendicular  (TE)  to  the  plane  of  incidence. 
For  reflection  of  a  wave  from  an  interface  between  two  materials  with  (possibly)  complex 
indices  of  refraction  hi  and  h 2,  the  amplitude  reflection  coefficients  are  given  by  the  Fresnel 
equations  [8] 


hi  cos  9i  —  fi2  cos  6t 
hi  cos  9i  +  fi2  cos  Ot 
n 2  cos  Qi  —  hi  cos  9t 
h 2  cos  Qi  +  hi  cos  Ot 


(3.14) 


where,  0-L  is  the  angle  between  the  incident  propagation  vector  and  the  interface,  and  Ot  is 
given  by  the  law  of  refraction  such  that 


hi  sin  Qi  =  fi2  sin  Ot 


(3.15) 


For  the  cases  of  interest,  the  reflecting  interface  is  between  free-space  where  hi  =  1,  and  an 
arbitrary  material  where  |h2 1  >  1- 

As  typical  examples,  the  amplitude  reflection  coefficients  and  reflection  phases  for 
500rem  radiation  reflected  from  an  air- fused  silica  interface  with  hi  =  1  and  fi2  =  1.45  [26], 
and  an  air-unoxidized  aluminum  interface  with  hi  =  1  and  fi2  =  0.62(1  +  j'4.8)  [38]  are 
shown  in  Fig.  3.2  and  Fig.  3.3.  Unless  the  incident  light  is  completely  TM  or  TE  polarized, 
the  incident  field  will  have  components  containing  both  TM  and  TE  components.  As  shown 
in  Fig.  3.2  and  Fig.  3.3,  the  reflection  coefficients  differ  between  TM  and  TE  polarizations. 
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The  change  in  relative  amplitudes  for  each  polarization  component  after  reflection  result  in 
different  major  and  minor  axes  for  the  polarization  ellipse,  and  the  change  in  phase  after 
reflection  shifts  its  rotation  angle. 


Amplitude  Reflection 


Reflection  Phase 


Figure  3.2:  Reflection  amplitude  and  phase  for  500nm  radiation  reflected  from  an  air 

(hi  =  1)  fused  silica  (h.2  =  1.45)  interface.  The  difference  between  polarizations  results  in  a 
change  in  the  polarization  state  of  the  reflected  radiation. 


If  the  observed  field  is  made  up  of  contributions  from  reflections  off  of  several  different 
materials,  or  of  reflections  from  differently  oriented  interfaces,  the  resulting  field  will  be 
partially  depolarized.  The  degree  of  depolarization  is  then  dependent  on  the  number  of 
contributions  and  the  degree  of  variance  between  the  polarization  of  the  contributions. 
When  reflection  depolarization  is  taken  to  the  micro-scale  to  describe  scattering  off  of  a 
rough  surface,  the  large  number  of  contributions  can  significantly  depolarize  the  field.  For 
a  complete  discussion  of  depolarization  via  rough  surface  scattering,  see  chapter  8  of  [4] . 


3.2  Scalar  Diffraction  Theory 

An  additional  consequence  of  the  scalar  Helmholz  equation  of  (3.3)  is  that  provided 
the  approximations  of  a  linear,  homogeneous,  isotropic  media  are  valid,  diffraction  and 
propagation  of  the  vector  fields  can  be  described  using  scalar  theory  for  each  polarization 
component  independently.  Using  this  result,  Green’s  theorem  can  be  used  to  solve  (3.3) 
for  a  given  component  U  of  the  vector  field  using  diverging  spherical  waves  for  the  Green’s 
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Amplitude  Reflection  Reflection  Phase 


Figure  3.3:  Reflection  amplitude  and  phase  for  500nm  radiation  reflected  from  an  air 

(hi  =  1)  aluminum  (77-2  =  0.62(1  +  j4.8))  interface.  The  difference  between  polarizations 
results  in  a  change  in  the  polarization  state  of  the  reflected  radiation 


function  [25].  Doing  this  leads  to  the  most  general  expression  describing  scalar  diffraction 
of  monochromatic  radiation  between  two  planes  given  by 

uix,y)  =  J_  rr  „(£,„)  2fEhhalxWd{d,  (3.16) 

JAJJ- 00  r01 

where  k  =  %(■)  is  an  “obliquity”  factor  (typically  cos(0)),  and  9  is  the  angle  be¬ 

tween  a  vector  normal  to  the  source  wavefront  at  the  point  (£,  77)  and  the  vector  of  length 
nn  =  (x  —  £)2  +  (y  —  rj)2  connecting  the  point  (£,17)  in  the  source  plane  with  the 

point  (x,y)  in  the  observation  plane  [25]. 

Approximating  %  (9)  ~  the  diffraction  equation  becomes 

V[x,v)  -±fr  <«*,  (3.17) 

JaJJ- 00  r0i 
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Further  simplification  comes  by  using  the  binomial  approximation  y/1  +  e  ~  1  +  §  to  ap¬ 
proximate  r 01  in  the  exponential  as 


ro  1  =z\  1  + 


x-i 


+ 


y  -  n 


z 


(x  -  £)2  +  {y-  y? 

2z 


(3.18a) 

(3.18b) 


and  tq i  ~  z  otherwise.  The  end  result  of  these  simplifications  is  the  Fresnel  diffraction 
integral  given  by 


U(x,y )  =  J^ej^z{x2+y2)  J f  {u(^,v)ej^2+r>2^e-j^M+yri)d^dri 


(3.19) 


An  additional  simplification  can  be  made  when  the  distance  between  the  source  and 
observation  planes  is  sufficiently  large.  Known  as  the  Fraunhoffer  diffraction  integral,  it’s 
valid  when  the  product  of  the  object’s  maximum  dimension  and  the  wavelength  is  sufficiently 
small  compared  with  the  separation  between  the  object  and  observation  planes,  i.e.  when 

21  »  \k(imax  +  Vmax)  (3-20) 

where  ^max  and  Vmax  are  the  maximum  linear  dimensions  of  the  object  along  the  £  and  v  axes 
respectively  [25].  In  this  case,  the  quadratic  phase  term  inside  the  integral  is  approximately 
unity  over  the  region  of  interest  and  can  be  safely  ignored,  resulting  in  the  Fraunhoffer 
diffraction  equation  given  by 

U(x,y)  =  %--ej^x2+y2'>  U^,v)e~j^+yv)  d£dy  (3.21) 

J  i/  i/  —  oo 

which  is  valid  for  the  cases  of  interest  here.  Ignoring  the  scaling  terms  outside  the  integral, 
this  relationship  can  be  recognized  as  a  Fourier  transform  of  the  near-field  distribution  when 
the  frequency  variables  u  and  v  are  evaluated  at  u  =  and  v  =  jL. 
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3.3  Coherence  Theory 

The  basic  premise  of  coherence  theory  is  the  characterization  of  the  statistical  prop¬ 
erties  of  radiation.  In  the  most  general  form,  optical  coherence  is  defined  as  the  correlation 
between  the  field  U  at  a  point  P\  at  time  t  and  at  a  second  point  P2  at  time  t+r,  described 
by  the  mutual  coherence  function  Ti2  [24],  given  by 

r12(P1,P2,T)  =  E[U(P1,t)U*{P2,t+T)\  (3.22) 

From  this  general  expression,  temporal  and  spatial  coherence  effects  can  be  described  by 
setting  the  variables  appropriately.  For  example,  temporal  coherence  effects  can  be  described 
using  (3.22)  and  co-locating  the  observation  points  P±  and  P2. 

3.3.1  Mutual  Intensity.  The  most  significant  coherence  properties  for  the  active 
imaging  problem  are  the  spatial  coherence  properties  obtained  when  the  correlation  is  eval¬ 
uated  with  r  =  0.  In  this  case,  the  mutual  coherence  function  is  given  a  special  designation 
called  the  mutual  intensity  J12,  defined  by 


Ji2(Pi,P2)  =  r12(Pi,P2,0)  =E[U(P1,t)U*(P2,t)\  (3.23) 

If  P\  =  P2,  the  mutual  intensity  Ji2(P\ ,  P\ )  equals  the  average  intensity  in  the  observation 
plane  E  [/(Pi)]. 

3.3.2  Propagation  of  the  Mutual  Intensity.  The  specific  character  of  the  mutual 
intensity  function  for  light  reflected  from  a  distant  object  is  dependent  on  both  the  nature 
of  the  object  being  illuminated  and  the  character  of  the  illumination.  Typically,  the  mutual 
intensity  at  the  object  is  easily  described,  and  the  problem  becomes  finding  the  mutual 
intensity  in  the  observation  plane  given  the  mutual  intensity  at  the  object. 

For  convenience,  let  P\  =  (£1,771)  and  P2  =  {f,2,rj2)  be  points  in  the  object  plane,  and 
let  Q\  =  (ui,v±)  and  Q2  =  (u2,v2)  be  points  in  the  observation  plane.  Equation  (3.16)  can 
be  used  to  find  the  field  at  points  Q  \  and  Q 2  which  can  then  be  substituted  into  equation 
(3.22)  to  find  the  mutual  intensity.  Doing  this,  interchanging  the  order  of  integration  and 
expectation,  and  expressing  r  as  a  function  of  the  path  length  difference  from  the  source  to 
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points  Q2  and  Q 2,  results  in  an  expression  for  the  mutual  intensity  in  the  observation  plane 
given  by 


Jl2(Ql,  Q2)  — 


Ji2(Pi,  P2)  exp 


2n  . 

-Jy  {r2  -  ri) 


x(#i)  X(<?2) 

\r\  A  r2 


d^idr/id^2dr?2 


(3.24) 


where  n  is  the  distance  from  the  point  Qi  to  P\,  r2  is  the  distance  from  the  point  Q2 
to  the  point  P2,  and  9 1  and  02  are  the  angles  between  r\  and  r2  and  the  surface  normal 
respectively  [24], 


3-4  Speckle  Statistics 

The  speckled  appearance  of  coherent  light  scattered  from  almost  any  surface  is  a  well 
known  phenomenon.  It  was  recognized  early  on  that  speckle  was  due  to  interference  in 
the  detector  plane  [12].  When  coherent  light  reflects  from  a  rough  surface,  the  differences 
in  surface  profile  result  in  slight  differences  in  propagation  distances  as  shown  in  Fig.  3.4. 
These  path-length  differences  are  adequately  represented  mathematically  by  the  addition 
of  a  random  phase  to  the  wave  front.  Propagation  of  the  field  with  the  addition  of  this 
phase  results  in  the  random  interference  patterns  known  as  speckle.  To  understand  the 
implications  and  uses  of  speckle,  its  statistical  properties  need  to  be  understood. 


RMS  Roughness 
»  X 


Figure  3.4:  When  coherent  light  reflects  from  a  rough  surface,  the  slight  differences  in 

path-length  result  in  random  phase-front  variations  that  are  manifest  as  speckle. 


3-4-1  First-Order  Speckle  Statistics.  The  first  property  of  interest  for  laser  speckle 
is  the  statistical  distribution  of  the  intensity  at  a  point  in  the  observation  plane.  To  do  this, 
a  rough  object  is  defined  such  that  the  phase  induced  by  surface  roughness  upon  reflection 
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is  randomly  distributed  on  the  interval  ( — 7r,  7t]  .  Physically,  this  corresponds  to  an  object 
with  an  RMS  surface  roughness  of  one  wavelength  or  more,  a  condition  that  is  satisfied  for 
all  but  the  most  highly  polished  surfaces  at  optical  wavelengths.  When  the  assumption  of 
a  rough  object  is  valid,  the  field  in  an  observation  plane  some  distance  away  is  given  by  a 
superposition  of  contributions  from  the  randomly  phased  fields  in  the  object  plane.  In  this 
case,  the  problem  can  be  viewed  as  the  sum  of  complex  phasors,  which  in  the  limit  of  many 
scattering  locations  simplifies  to  complex  circular  Gaussian  statistics  given  by  [12] 

P(A’'’'4i)  =  2^eXP{-^2^‘}  (3'25) 

where 


Ar  =tRe{U}  (3.26a) 

Aj  =Im{U}  (3.26b) 


are  the  real  and  imaginary  parts  of  the  complex  field,  a2  is  given  by 

j2_E  [Iq] 


(3.27) 


and  I0  is  the  intensity  reflected  from  the  object. 

Applying  variable  transformations  to  get  the  distribution  in  terms  of  intensity  and 
phase,  and  integrating  out  the  phase  to  get  the  marginal  intensity  distribution  leads  to  an 
exponential  distribution  in  intensity  given  by 


P(!P) 


E [Ip]  eXP  (  E[7P]  )  -  0 

0,  else 


(3.28) 


where  Ip  is  the  intensity  in  the  far-held  [24] .  One  implication  of  this  distribution  is  that  the 
standard  deviation  of  the  intensity  is  equal  to  the  mean  E  \IP] . 


3-4-2  Second-Order  Speckle  Statistics.  An  understanding  of  second-order  speckle 
statistics  begins  with  a  description  of  the  mutual  intensity  at  the  object.  Assuming  that  the 
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correlation  length  of  the  surface  roughness  is  small  compared  with  the  size  of  the  object 
the  mutual  intensity  in  the  object  plane  can  be  described  by  [24] 


=  E  [Io  (Pi)]  s  (Pi  -  P2) 


(3.29) 


where  6  (Pi  —  P2)  is  the  Dirac  delta  in  Cartesian  coordinates.  If  the  object  is  uniformly 
illuminated,  E  [I0\  is  proportional  to  the  reflectivity  of  the  object. 

At  this  point  (3.24)  can  be  used  to  determine  the  mutual  intensity  in  the  observation 
plane.  Given  the  geometry  being  considered  and  the  form  of  the  mutual  intensity,  several 
simplifications  can  be  made.  In  the  far-held,  x(@o)  and  x(@i)  are  approximately  1,  and 
— E-  «  -4.  Additionally,  the  standard  paraxial  approximations  can  be  made  for  n  and  r2 
in  the  exponential, 


ri 


r-2 


\j z2  +  (ui  -  6)2  +  (vi  -  r/i)2 

,  (m  -  6)2  +  (vi  -  m)2 
z+  2z2 

\j z2  +  (U2  -  6)2  +  {V2  -  m)2 

,  {u2  -  6)2  +  (^2  -  m)2 
z+  2z2 


(3.30) 


Applying  these  approximations  and  employing  the  sifting  property  of  5 (Pi  —  P2)  results 
in  the  final  expression  for  the  mutual  intensity  in  the  observation  plane. 

J12  (A u,  Av)  =  2  J I  E[/(^,r/)]exp|-j^  [(Ait£  +  Au7/)]|d^dr/  (3.31) 

where  A u  =  rt2  —  u\,  Av  =  u2  —  v\,  il)p=  fz  [(«i  +  v2)  —  (u2  +  v f)] ,  and  7  is  a  scaling 
constant.  Although  derived  for  coherent  illumination  scattered  from  a  rough  object,  (3.31) 
is  functionally  identical  to  the  Van  Cittert-Zernike  theorem  for  incoherent  light  [24] .  At  this 
point  it’s  also  worth  noting  that  the  ensemble  averaged  intensity  in  the  far-held  is  given  by 


E  [ Ip(u ,  u)]  =  Ji2(Au  =  0,  Av  =  0) 
which  is  constant  across  the  observation  plane. 


(3.32) 
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One  characteristic  of  interest  is  the  pupil-plane  Power  Spectral  Density  (PSD)  of  the 
speckled  intensity.  According  to  the  Weiner-Khintchine  theorem,  the  PSD  and  autocorrela¬ 
tion  of  the  speckle  intensity  are  related  through  a  Fourier  transform  relationship  [46]  such 
that 

<I>pp(x,  y)  =  F{TPp{Au,  Au)}  (3.33) 

where  .+{•}  is  the  Fourier  transform,  ‘hpp  is  the  speckle  PSD,  and  rpp(Au,Au)  is  the 
speckle  autocorrelation  for  point  separations  A u  and  Av. 

Because  the  underlying  fields  obey  complex  Gaussian  statistics,  the  autocorrelation 
of  the  field  amplitude  and  the  intensity  are  related  through  the  complex  Gaussian  moment 
theorem  [24]  according  to 


Tpp  =E[Ip(ui,vi)]E[Ip(u2,v2)]  +  \Ji2(ui,v1,u2,v2)\' 


=  E  [IP]2  1  +  ]  A 


(3.34) 


where  the  complex  coherence  coefficient  yc  is  given  by 


Hc(ui,vi,u2,v2) 


_ +12  (ui,Vi,U2,V2) _ 

[+12  +12  (U2,V2,U2,V2)] 1/2 


(3.35) 


Recognizing  from  (3.31)  and  (3.35)  that  yc  is  a  scaled  Fourier  transform  of  the  object 
intensity,  the  normalized  near-field  intensity  PSD  <f >raj  can  be  written 


®nf  (u,  V )  =  \yc  (A u,  Au)|2 


(3.36) 


Substituting  (3.36)  into  (3.34)  and  applying  the  Weiner-Khintchine  theorem,  the  far- 
held  intensity  (speckle)  power  spectral  density  can  be  written  as 


<S>PP(x,y)  =  E[Ip}2 


5(x,y)  +  fi&nf  (u,v)} 


(3.37) 


Because  the  near  held  intensity  E  [Ip\  is  real  and  non-negative,  +7{<f>n/}  =  T  1{<I>n/})  which 
can  be  used  with  (3.33)  to  give 


4>pp(u,  v)  =  E  [Ip]2  [5(u,  v )  +  rn] 


(3.38) 
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where  Tn  is  the  normalized  autocorrelation  of  the  near-field  intensity  given  by 


Fn(x,  y )  =  jo(x,  y)  ©  o(x,  y) 


(3.39) 


and  where  ©  is  a  two-dimensional  correlation. 

Additional  statistical  properties  of  the  far-held  pattern  can  be  found  provided  the 
joint  probability  distribution  function  for  the  applicable  quantity  is  known.  Of  particular 
interest  are  the  joint  distributions  for  the  complex  amplitude,  intensity,  and  phase.  As 
stated  previously,  the  held  in  the  detector  plane  follows  joint  complex  circular  Gaussian 
statistics  [12],  resulting  in  a  probability  distribution  function  given  by 


P(U) 


(2tt)-/2^IV| 


exp 


--uHTju 


(3.40) 


where  U  is  a  vector  of  complex  amplitudes,  H  is  the  Hermitian  transpose,  and  IV  is  the 
associated  correlation  matrix  given  by  IV  =  E  [?7J7'ff]  [35].  From  this,  the  joint  statistics 
of  the  amplitude,  phase,  and  intensity  can  be  derived. 


To  specify  the  second-order  density  function  of  the  amplitude  and  phase,  the  complex 

(r)  .  (i) 

amplitudes  U\  and  U2  at  two  distinct  points  are  written  in  as  U\  =  U{  +  iU\  and  U2  = 
Urp  +  iufjp.  The  joint  PDF  of  U\  and  U2  is  then  given  by 


p(u[r\u?\ 


u£\u®) 


exp 

|f/l|2  +  |C/2|2-Mct/lt/2*- 

-i£urua 

2o-2(l-|//c|2) 

47T2<74 

1 - 1 

1 

"F 

O 

to 

(3.41) 


where  fic  is  the  complex  coherence  factor  from  (3.35),  and  it  has  been  assumed  that 
E[/p(ui,ui)]  =  E  [Ip(u2,v2)\  =  2 a2  [12]. 
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The  joint  statistics  of  the  intensity  and  phase  are  then  related  to  (3.41)  through  the 
variable  transformations 

=  ^/hcos(^l) 

=  v^cos(02) 

(3.42) 

U?  =  V^sin^) 

U P  =  \//2  sin(02) 


Noting  that  the  Jacobian  of  this  transformation  is  1/4,  and  defining  /i=  \/jlc\  and  i/j=  Z//c, 
the  resulting  joint  PDF  for  the  intensity  and  phase  is  given  by 


Pi,e{hih ,  #ii  O2) 


exp 


Ii+h-2^hl2ticos(0i-62+i(>) 

2<t2(1— fi'2) 

167r2(T4(l  —  [i2) 


(3.43) 


which  can  be  integrated  to  find  the  marginal  joint  densities  for  the  intensity  and  phase. 
From  [12] ,  the  joint  density  for  the  intensity  is  given  by 


Pi(h,h)  = 


exp 


h+h 


h 


E  [I]2  (1  —  P2)  VE[/](l-/i2) 


2nVhh 


(3.44) 


where  Jo  is  a  modified  Bessel  function  of  the  first  kind,  zero  order.  Also  from  [12],  the  joint 
density  of  the  phase  is  given  by 


Pe(9 1,^2)  = 


1-M2 


47r2(l  —  /32)3/2 


sin  1  P+^-  +  y7!  -  /32^j  (3.45) 


where  f3  =  /rcos(#2  —  9\  +  y 1),  and  6\  and  #2  are  in  the  interval  (— 7r,7r). 

This  chapter  introduced  the  basic  building  blocks  of  polarization,  scalar  diffraction, 
and  the  statistical  nature  of  light.  With  these  tools  in  hand,  the  next  step  is  to  use  these 
tools  to  build  up  a  system  model  for  use  in  algorithm  development. 
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IV.  Conventional  Imaging  and  Atmospheric  Turbulence 


With  an  understanding  of  scalar  diffraction  and  other  optical  phenomenon  in  hand, 
the  next  step  required  to  begin  developing  a  reconstruction  algorithm  is  forming  a 
model  for  the  imaging  system  that  includes  the  effects  of  aberrations  on  image  formation. 
Beginning  with  the  diffraction  theory  developed  in  chapter  III,  a  model  for  image  forma¬ 
tion  using  both  coherent  and  incoherent  illumination  is  developed.  The  effects  of  system 
aberrations  and  the  usage  of  Zernike  coefficients  for  describing  them  are  presented.  Because 
the  primary  aberrations  of  interest  are  caused  by  atmospheric  turbulence,  a  statistical  de¬ 
scription  of  the  turbulence  is  developed  and  again  described  in  terms  of  Zernike  modes.  A 
method  for  simulating  the  effects  of  turbulence  using  Zernike  modes  is  presented.  Finally, 
conventional  phase  diversity  techniques  for  reconstructing  images  from  aberrated  data  are 
presented  for  both  Gaussian  and  Poisson  dominated  noise  statistics. 


4-1  Basic  System  Model 

Before  diffraction  theory  can  be  used  to  describe  the  imaging  process,  a  model  for 
the  imaging  system  through  which  the  light  will  propagate  must  be  developed.  For  an 
achromatic  imaging  system,  even  complex  configurations  can  be  modeled  using  a  single  thin 
lens  located  in  the  pupil-plane  of  the  system  as  shown  in  Fig.  4.1  [8].  The  object  is  described 
by  the  field  U0(£,  rj)  in  a  plane  a  distance  z0  from  the  imaging  system  pupil,  the  effective 
lens  and  pupil-plane  field  Up(u ,  v)  are  located  in  the  pupil  plane  of  the  imaging  system,  and 
the  image-plane  field  Ui(x,  y )  is  in  a  plane  a  distance  Z{  behind  the  pupil.  Diffraction  theory 
as  outlined  in  chapter  III  is  used  to  translate  the  field  between  planes.  The  effect  of  the  lens 
can  be  described  by  multiplying  the  incident  field  with  a  real  pupil  function  and  complex 
phase  such  that  the  field  just  after  the  lens  is  given  by 

Up(u,  v )  =  Up(u,  v)P(u,  v)e~jMAu2W)  (4.1) 

where  fi  is  the  focal  length  of  the  effective  lens  and  P(u,  v )  is  the  transmission  function  of 
the  imaging  system’s  pupil.  Frequently,  P{u,  v)  is  a  clear  circular  pupil  described  by 
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Figure  4.1:  Simplified  imaging  system.  Imaging  systems  can  generaly  be  described  by  a 

simple  lens  with  a  diameter  equal  to  the  imaging  system  pupil  and  a  focal  length  equal  to 
the  focal  length  of  the  imaging  system. 


P(u,  v ) 


1,  u2  +  v2  <  1 

< 

0,  else 


(4.2) 


where  the  coordinate  system  has  been  scaled  such  that  the  pupil-diameter  is  one  to  simplify 
later  work. 


4-1.1  Coherent  Imaging.  Because  diffraction  is  a  linear  operation,  the  effect  of 
diffraction  through  a  linear  system  can  be  described  in  general  by  the  superposition  integral 


Ui(x,y) 


h(x,y;£,r])U0(£,r])d£,dri 


(4.3) 


where  the  impulse  response,  h(x,y,^,r/),  is  the  image-plane  field  that  would  result  from  a 
point  source  located  at  the  point  (£,77)  in  the  object  plane  [25].  Using  a  point  source  object 
to  obtain  h,  the  field  in  the  pupil  plane  is  simply  a  diverging  spherical  wave;  which  under 
the  paraxial  approximation  is  given  by 

Up(u,v)  =  ^-exj [(u  -  £)2  +  (v  -  rj)2]  |  (4.4) 
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Applying  the  lens  and  using  Fresnel  diffraction  to  propagate  the  pupil  plane  field  to  the 
image  plane,  the  resulting  impulse  response  for  the  imaging  system  is  given  by 


h{x,  y\  £,  rf)  = 


A2 


exp 


2o 

P(u, v ) exp 


exp 


•  71  (  2  2\ 

3—{x,y) 


Jt  (—  +  -  -  j  )  {u2  +  v2) 

A  Zi  fi  1 


exp  <  -j 


2tt 

T 


i  ,  v  <  y  i 

— i - u  + - 1 - \v 

Zo  Zi)  \Z0  Zi 


dudu  (4.5) 


Considering  the  quadratic  phase  term  within  the  integral,  when  the  Gauss  lens  law  is  sat¬ 
isfied,  i.e.  when 

1  1  1 

(4.6) 


i  _  i  1 

fl  Z0  Zi 


the  phase  term  vanishes,  leaving  only  the  phase  terms  outside  the  integral  to  deal  with.  For 
the  term  involving  image-plane  coordinates  x  and  y,  if  the  intensity  is  the  final  quantity  of 
interest  this  term  can  be  neglected  outright.  The  term  involving  the  object-plane  coordinates 
£  and  r]  isn’t  so  easily  neglected;  however,  this  term  can  also  be  approximated  as  unity  under 
certain  assumptions  that  will  be  invoked  here  [25]. 

To  further  simplify  the  impulse  response,  the  system  magnification  M  is  defined  as 


M  = - - 

Zn 


(4.7) 


and  the  scale  constants  outside  the  integral  are  dropped,  resulting  in 

h(x  —  £,y  —  rf)  =  JJ  P(u,  v)  exp  j^-(x  —  M£)  u  +  (y  —  Mrj)  dudv  (4.8) 

which  when  M  =  —1,  is  the  impulse  response  function  for  a  linear,  shift-invariant  system 
that  can  be  recognized  as  a  scaled  Fourier  transform  of  the  imaging  system  pupil-function. 
Applying  this  impulse-response  function  to  (4.3)  results  in  the  final  model  for  the  field  in 
the  image  plane 

f‘  POO 

Ui(x,y)=  f70  (£,  r?) /?,  (x  -  £,  y  -  ??)  d£d??  (4.9) 

J  J  —  OO 
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The  instantaneous  intensity,  /*,  in  the  image  plane  is  then  given  by 


h{x,y)  = 


Uo  (£,  y)  h(x-£,y-  rj)  d£dr/ 


(4.10) 


Because  all  detectors  integrate  for  some  period  of  time,  the  instantaneous  intensity  must  be 
averaged  [24],  resulting  in 

f‘  POO  P  poo 

E  [Ii(x,y)\=  //  d^idr/i  //  d&dr]2h(x-t1,y-r)2)h*(x-&,y-y2)'E[Uo{£i,yi)UZ(&,r]2)] 

J  J  —  oo  J  J  —  oo 

(4.11) 

For  coherent  illumination,  the  fields  are  deterministic  and  the  expectation  can  be  dropped. 
The  integrals  can  then  be  separated  such  that  the  final  expression  for  the  coherent  image 
reverts  to  the  instantaneous  intensity  given  by  (4.10). 


4-1.2  Incoherent  Imaging.  When  the  illumination  is  incoherent,  the  mutual  inten¬ 
sity  is  adequately  described  by 


e  [Uo(€i,m)u0(&,  m)}  =  <K£i  -  6,  yi  -  mVo^yi)  (4.12) 


where  I0(^i,rji)  is  the  intensity  in  the  object  plane.  Substituting  this  into  (4.11)  and  em¬ 
ploying  the  sifting  property  of  the  5-function  results  in  an  expression  for  the  intensity  in  the 
image  plane  under  incoherent  illumination  given  by 


h(x,y ) 


£,y-  y)io(£,y)d£,dr] 


(4.13) 


where  the  point-spread  function  s  is  given  by 


s{x,y)  =  \h{x,y)\2 


(4.14) 


4-1.3  Aberrations  and  Zernike  Polynomials.  In  general,  the  imaging  system  isn’t 
completely  described  by  a  perfect  thin  lens,  and  consequently  suffers  from  some  form  of 
aberration  which  serves  to  widen  the  point-spread  function  and  blur  the  resulting  image. 
The  simplest  approach  to  describing  the  effects  of  aberrations  is  to  add  their  contributions 
to  the  system  model’s  pupil  function.  The  pupil  function  is  allowed  to  be  complex  and 
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defined  as 

P'(u,v)  =  P{u,v)ej^u’v)  (4.15) 

where  any  obstructions  or  amplitude  transmission  variations  are  included  in  P(u,v),  and 
phase  aberrations  such  as  defocus,  coma,  astigmatism,  etc...  are  included  in  the  phase  term 

To  simplify  analysis,  the  phase  aberration  cf>(u,  v)  is  frequently  projected  against  a  basis 
set  to  parameterize  the  aberrations.  Because  they  coincide  with  the  classical  aberrations  and 
tend  to  efficiently  represent  the  aberration  with  relatively  few  terms,  Zernike  polynomials 
(modes)  are  often  used  as  the  basis  set.  For  this  work,  the  Zernike  modes  are  as  defined 
and  indexed  by  Noll  [37]  such  that 


Zj(r,9)  =  < 


Vn  +  lR®{r) 


where 


n  —  m 
2 


jcw  =  £ 


(— 1  )s(n  —  s)! 


m/0 

and 

j  even 

m  0 

and 

j  odd 

771.  =  0 

0! 

n— 2s 

(4.16) 


-  *  ! 


(4.17) 


s=0  L  2  “J '  !  2 

r  =  Vu2  +  v2,  6  =  tan-1  (v/u),  and  O<r<l,O<0<  2n.  The  sequence  of  modes  defined 
by  the  mode  index  j  is  such  that  incrementing  j  increments  the  angular  mode  number  m 
until  it’s  equal  to  the  radial  mode  number  n,  at  which  point  n  is  incremented  and  m  reset 
to  zero.  The  progression  in  mode  number  for  modes  1  through  16  is  shown  in  Table  4.1 
along  with  the  corresponding  common  name  (where  applicable)  for  the  particular  aberration 
represented  by  a  given  mode. 


4-2  Atmospheric  Turbulence 

Quite  frequently  the  limiting  system  aberration  is  the  blurring  caused  by  atmospheric 
turbulence.  This  effect  can  be  easily  seen  in  the  “twinkling”  of  stars  and  results  in  much 
worse  resolution  than  would  be  the  case  if  the  atmosphere  were  not  a  contributing  factor. 
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Table  4.1:  First  16  Noll-indexed  Zernike  modes  with  the  associated  common  aberration 

names  [41] 


3 

n 

m 

Zj(r,9) 

aberration 

1 

0 

0 

1 

piston 

2 

1 

1 

2r  cos (9) 

tilt  (x) 

3 

1 

1 

2r  sin  ( 9 ) 

tilt  (y) 

4 

2 

0 

y/3  (2 r2  -  l) 

defocus 

5 

2 

2 

y/6r2  sin  (29) 

astigmatism  (x) 

6 

2 

2 

a/6 r2  cos  (29) 

astigmatism  (y) 

7 

3 

1 

a/8  (3r3  —  2r)  sin  (9) 

coma  (x) 

8 

3 

1 

a/8  (3r3  —  2 r)  cos  (9) 

coma  (y) 

9 

3 

3 

a/8 r3  sin  (39) 

10 

3 

3 

a/8 r3  cos  (39) 

11 

4 

0 

a/5  (6r4  -  6r2  +  l) 

3rd  orcier  spherical 

12 

4 

2 

y/W  (4r4  —  3r2)  cos  (29) 

13 

4 

2 

\/l0  (4r4  —  3r2)  sin  (29) 

14 

4 

4 

y/lO r4 cos (49) 

15 

4 

4 

a/10 r4 cos (49) 

16 

5 

1 

a/12  (I0r5  -  12r3  +  3r)  cos  (9) 

4-2.1  Kolmogorov  Turbulence  Theory.  A  complete  description  of  turbulence  theory 
would  require  an  entire  book,  and  multiple  sources  are  available  for  more  detailed  descrip¬ 
tions  (see  for  example  [2,41]).  Only  a  brief  overview  of  the  relevant  results  will  be  presented 
here. 


Using  dimensional  analysis  and  physical  insight,  Kolmogorov  developed  a  power  spec¬ 
trum  for  velocity  fluctuations  in  a  turbulent  medium  that  was  extended  to  describe  the 
power  spectrum  for  index  of  refraction  variations,  4>n,  given  by  [2] 

111  1 

$n(«)  =  0.033C£k-t,  —  <k<-  (4.18) 

J^O  l o 

where  k=  27t/\/ u 2  +  v2  is  the  spatial  wavenumber,  La  is  the  turbulence  outer  scale  which 
corresponds  roughly  to  the  size  of  the  largest  turbulent  area  that  can  be  analyzed  indepen¬ 
dently  from  the  parent  flow,  lQ  is  the  turbulence  inner  scale  which  is  roughly  the  size  of  the 
smallest  intact  regions  of  turbulent  flow,  and  C2  is  the  index  of  refraction  structure  function 
constant  characterizing  turbulence  strength.  In  general,  C2  will  be  a  function  of  altitude, 
but  a  path-averaged  value  will  be  used  here. 
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To  transition  from  index  of  refraction  fluctuations  to  phase  fluctuations,  the  aberration 
phase  due  to  propagating  a  distance  L  along  the  z  axis  through  the  turbulent  flow  is  found 
by  adding  up  contributions  from  the  varying  optical  path  lengths  caused  by  the  changes  in 
refractive  index  according  to 

fL 

4>(u,v)  =  k  /  dzhi(u,v,z)  (4.19) 

Jo 

where  L  is  the  thickness  of  the  turbulent  layer,  the  index  of  refraction  for  the  flow  is  defined 
as 

h  (u,  v,  z)  =  no  +  n\  (u,  v,  z)  (4.20) 

ho  is  the  mean  index  value,  hi  is  the  small  perturbation  caused  by  turbulence,  and  L  is  the 
distance  along  the  propagation  path.  Assuming  phase  fluctuations  are  sufficiently  small  to 
approximate  their  cumulative  effect  in  a  single  layer,  the  field  resulting  from  propagating  a 
unit-amplitude  incident  plane  wave  through  the  turbulence  would  then  be  given  by 

Ut  (u,  v )  =  exp  \j(f>  (u,  u)]  (4.21) 

Assuming  wide-sense  stationarity  for  the  turbulence,  the  spatial  correlation  function  for  the 
resulting  field  is  then  given  by 


T^Ait,  Av)  =  E  [exp  [ j(j)(u ,  u)]  exp  [— j(j){u  —  A u,  v  —  Au)]] 


=  E 


exp 


jk  (  j  dz  [hi(u,  v,  z)  —  hi(it  —  Ait,  v  —  Av,  z)\ 


(4.22) 

(4.23) 


Assuming  the  index  fluctuations  follow  Gaussian  statistics,  (4.22)  can  be  simplified 


using 


where 


E  [exp  (a  +  jb)\  =  exp  [{^a  +  jfib)\  exp 


O  iaa  ~  fipOaOb  ~  Ob) 


P  = 


E  [(a  —  Pa){b  —  Pb)\ 


(4.24) 


(4.25) 


and  where  /jLa  and  fi &  are  the  mean  values  of  a  and  b  respectively.  Applying  (4.24)  to  (4.22) 
and  recognizing  that  the  real  part  of  the  exponent,  a,  is  zero  everywhere,  /Ua  =  <ra  =  0  [41]. 
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The  imaginary  part,  b,  of  (4.24)  is  seen  to  be  only  a  function  of  hi,  which  from  the  definition 
of  (4.20)  is  zero  mean.  Applying  this,  p, b  =  0,  and  the  resulting  phase  correlation  function, 
I>,  is  given  by 


T^Au,  Av)  =exp 


=  exp 


1 

2 

1 

2 


L 

dz  [hi  (it,  v,  z )  —  h\(u 
D^Au,  Av) 


A u,  v 


(4.26) 

(4.27) 


where  the  phase  structure  function  D^Au,  Av)  is  defined 


D^Au,  Av)  =k2  E 


dz  [hi  (it,  v ,  z)  —  hi(u  —  Au,  v  —  Av,  z)\ 


=  E 


(4>(u,  v)  —  (j){u  —  Au,  v  —  Av))' 


(4.28) 

(4.29) 


At  this  point  it’s  clear  that  describing  the  structure  function  is  sufficient  to  completely 
characterize  the  statistics  of  the  phase. 

Defining 

r  =  \J  Au2  +  Av 2  (4.30) 

and  assuming  Kolmogorov  statistics,  the  phase  structure  function  is  given  by  [41] 


Dlj)(Au,  Av)  =  2.91k2 LC2rs 


Defining  the  Fried  parameter  ro  [41] 


0.185 


47T2 


3 

5 


the  structure  function  reduces  to 


D^Au,  v)  =  6.88 


(4.31) 


(4.32) 


(4.33) 


4- 2. 2  Representing  and  Simulating  Turbulence  with  Zernike  Polynomials.  As  with 
other  aberrations,  it’s  frequently  convenient  to  describe  turbulence  effects  in  terms  of  Zernike 
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modes.  Using  Kolmogorov  statistics  as  given  above,  Noll  wrote  the  power  spectral  density 
for  phase  fluctuations  as  [37] 


$^(k)  =  0.023rQ/3fv-11/3 


(4.34) 


Using  this  spectrum,  Noll  showed  that  the  Zernike-mode  expansion  coefficients  were  Gaus¬ 
sian  distributed  with  zero  mean  and  a  covariance  matrix,  Tz,  defined  by 


(r* 


'3,3 


-0.145 


7 r 


5 

\J{n  +  1  )(n'  +  1)(— <5„ 


n+n'  —  2m  f°°  _  5  Jn^.\  (27T/u)  (27T/l) 

')  2  akk  3  — 


(4.35) 

where  D  is  the  diameter  of  the  imaging  system  pupil,  is  the  coefficient  for  the  jth  mode,  n 
and  m  are  the  radial  and  angular  mode  numbers  for  the  jth  mode,  and  5mrni  is  the  Kronecker 
delta.  The  integral  in  (4.35)  can  be  analytically  evaluated,  and  is  given  by  [37] 


—  5  Jn+i(2irk)Jni+i(2nk) 
3  p 


r(¥)r 

(n+n')  23 

2  6 

214/3p 

(n-n')  .  17 

2  “r  6 

r 

(n+n')  23 

2  “l"  6 

(4.36) 


One  notable  property  of  (4.35)  is  that  the  only  dependence  on  the  atmosphere  or 
imaging  system  is  contained  in  the  D/r o  term,  which  can  be  factored  out  as  a  scale  constant. 
The  covariance  matrix  can  then  be  computed  for  D/r o  =  1  and  used  for  arbitrary  turbulence 
strengths  by  simply  scaling  the  result  by  (D/ro)5^3 .  The  normalized  covariance  matrix,  for 
modes  2-10  is  given  in  Table  4.2. 

Table  4.2:  Normalized  Zernike  mode  covariance  for  the  first  10  modes  excluding  piston 

and  assuming  Kolmogorov  statistics.  To  obtain  the  true  covariance,  multiply  these  values 
by  (£>/r0)5/3. _ 


j 

2 

3 

4 

5 

6 

7 

8 

9 

10 

2 

0.448 

0 

0 

0 

0 

0 

-0.0141 

0 

0 

3 

0 

0.448 

0 

0 

0 

-0.0141 

0 

0 

0 

4 

0 

0 

0.0232 

0 

0 

0 

0 

0 

0 

5 

0 

0 

0 

0.0232 

0 

0 

0 

0 

0 

6 

0 

0 

0 

0 

0.0232 

0 

0 

0 

0 

7 

0 

-0.0141 

0 

0 

0 

0.0062 

0 

0 

0 

8 

-0.0141 

0 

0 

0 

0 

0 

0.0062 

0 

0 

9 

0 

0 

0 

0 

0 

0 

0 

0.0062 

0 

10 

0 

0 

0 

0 

0 

0 

0 

0 

0.0062 

One  application  of  these  results  is  the  generation  of  simulated  phase  screens  using 
Zernike  modes.  To  build  a  random  phase  screen  with  Kolmogorov  statistics,  a  vector  of 
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Gaussian  distributed  random  numbers  characterized  by  the  covariance  of  (4.35)  is  generated 
and  used  to  weight  the  Zernike  modes  which  are  then  summed. 

The  correlated  Gaussian  random  numbers  can  be  generated  by  utilizing  the  fact  that 
any  linear  transformation  of  Gaussian  random  numbers  is  also  Gaussian  distributed  [35]. 
Furthermore,  given  the  joint  Gaussian  random  vector  x  ~  A/"(0,S),  the  transformation 
matrix  M,  and  defining  y  =  Mr,  the  transformed  vector  y  will  be  described  by 

y  ~  A7(0,  MSM^)  (4.37) 

If  x  ~  A7(0, 1),  as  would  be  the  case  for  computer-generated  random  numbers,  MSMi;  = 
MMh.  Because  the  correlation  matrix  is  symmetric  and  positive  definite,  a  vector  of 
random  numbers  with  the  desired  correlation  T  can  be  generated  by  factoring  the  correlation 
matrix  into  its  Cholesky  decomposition  such  that 

T  =  CHC  (4.38) 

then  generating  a  vector  of  uncorrelated  random  numbers  and  performing  the  vector-matrix 
product 

y  =  CHx  (4.39) 

at  which  point  y  has  the  desired  statistics. 

To  simplify  computation,  a  reduced  set  of  Zernike  modes  may  be  used,  sacrificing 
some  of  the  high-frequency  content.  While  this  is  an  incomplete  representation,  the  results 
shown  in  Table  4.2  clearly  show  that  the  parameter  variance,  and  thereby  the  “power”  in  a 
given  mode,  decreases  rapidly  with  increasing  order.  Because  of  the  ability  of  the  Zernike 
mode  basis  set  to  efficiently  pack  the  majority  of  the  aberration  into  the  lowest  order  terms, 
a  truncated  parameter  vector  can  still  accurately  represent  the  aberration  phase. 

4-3  Phase  Diversity  Imaging 

One  successful  approach  to  producing  high-resolution  images  in  the  presence  of  atmo¬ 
spheric  and  systemic  aberrations  is  to  use  phase  diversity  (PD)  techniques  to  post-process 
aberrated  data.  First  proposed  by  Gonsalves  as  a  means  for  phase  estimation  [22,23],  phase 
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diversity  imaging  uses  information  contained  in  both  a  conventional  and  an  intentionally 
aberrated  image  of  the  object,  both  of  which  have  been  degraded  by  the  same  unknown 
aberration.  Before  modifying  this  approach  to  accommodate  the  active  imaging  scenario, 
an  understanding  of  the  basic  technique  is  required.  Once  the  basic  algorithm  is  understood, 
it  can  be  extended  to  work  with  active  imaging  data. 

A  conventional  phase  diversity  imaging  system  is  composed  of  a  traditional  imaging 
system  (in  this  case  a  telescope  focused  at  infinity)  with  the  addition  of  a  second  detector 
plane  which  has  been  subjected  to  some  known  aberration.  Because  of  easy  implementation 
and  mathematical  simplicity,  system  developers  have  traditionally  chosen  defocus  for  the 
known  aberration.  A  notional  phase-diversity  imaging  system  is  illustrated  in  Fig.  4.3.  The 
following  development  of  conventional  phase  diveristy  imaging  under  Gaussian  and  Poisson 
noise  statistics  is  derived  from  that  of  Paxman  et  al.  [39] . 


Figure  4.2:  A  conventional  phase  diversity  imaging  system.  Data  is  collected  in  the  focal 
plane  and  in  a  plane  with  some  known  amount  of  defocus  added. 

4-3.1  System  Model.  The  initial  step  in  developing  a  phase  diversity  algorithm  is 
to  model  the  noiseless  imaging  system.  Consistent  with  section  4.1.2  the  system  is  modeled 
in  terms  of  its  optical  transfer  functions 

Hpp(u,v)  =P(u,  v)  exp  [— j(j)(u,  v)]  (4.40a) 

Hdp(u,  v )  =P(u,  v)  exp  [-j  v )  +  4>d(u,  u))]  (4.40b) 

where  P(u,  v )  is  the  amplitude  of  the  imaging  system’s  pupil-function,  4>(u,  v )  is  the  unknown 
aberration  phase  induced  by  the  optical  system  or  the  atmosphere,  and  4>d(u,  v )  is  the  known 
diversity  phase.  To  simplify  analysis,  the  aberrating  phase  and  diversity  phase  are  written 
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as  an  expansion  in  some  suitable  basis  set  such  that 


cj)(u,  v)  =  ^  akipk{u,  v )  (4.41) 

k 

and 

(j)D(u,  v )  =  y  Mk{u,  v )  (4.42) 

k 

where  and  /?*,  are  scalar  weights,  and  i/jk  is  the  kth  basis  function. 

Because  of  their  ability  to  describe  complex  aberrations  with  relatively  few  basis  func¬ 
tions,  the  Zernike  modes  as  described  in  section  4.1.3  are  used  as  the  basis  set.  Selecting 
defocus  for  the  diversity  aberration,  the  diversity  phase  is  completely  described  by  the  fourth 
Zernike  mode  such  that 

(j>D{u,  v )  =  /34V’4 {u,  v )  (4.43) 

Once  the  system  model  is  complete,  the  PSF’s  are  given  by 

sm(x,y)  =  \hm(x,y)\2  (4.44) 

where  m  £  { FP ,  DP}  and  the  impulse  response  is  defined  by 

hm(x,  y)  =  P{Hm(u,  u)}  (4.45) 

The  incoherent  images,  gm ,  that  would  result  in  the  absence  of  noise  are  then  given  by 

9m(x,y)  =  o(x,y)  ®  sm(x,y)  (4.46) 

where  o  is  the  idealized  image  that  would  be  formed  under  an  aberration-free  geometrical 
optics  approximation  and  (8)  represents  a  two-dimensional  convolution. 

At  this  point  in  the  development,  it’s  necessary  to  statistically  describe  the  dominant 
noise  source;  then,  using  the  noise  statistics,  develop  an  appropriate  estimator.  In  the 
simplest  cases  such  as  when  detector  thermal  noise  is  the  dominant  source,  the  noise  is 
described  as  an  additive  white  Gaussian  process.  Alternatively,  for  low-noise  detectors,  the 
dominant  noise  source  would  more  likely  be  photon  or  shot  noise  characterized  by  Poisson 
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statistics.  Each  noise  source  leads  to  a  different  estimator,  both  of  which  will  be  described 
below. 


4-3.2  Additive  Gaussian  Noise  Limited  Detection.  When  the  noise  is  adequately 
described  using  additive  white  Gaussian  statistics,  the  data  dm  is  modeled  by 


dm(x,  y )  =gm (x,  y)  +  nm{x,  y) 


(4.47a) 


=o  (x,  y)  <g>  sm(x,  y)  +  hm(x,  y) 


(4.47b) 


where  nm{x,  y)  is  spatially  uncorrelated  Gaussian  noise  for  the  mth  channel.  The  probability 
distribution  for  the  data  is  then  given  by 


w(p) = nn 

m  x,y 


i 


exp 


( dm{x,y )  -gm(x,y)Y 

2(7? 


(4.48) 


where  V  is  a  complete  data  set  containing  both  focal-plane  and  diversity  plane  intensity 
measurements.  From  (4.48)  the  log-likelihood  function  is  then 


4o,  *>--££  [dm(x,y)  -  o(x,y)  <g>  sm(x,y)}2  (4.49) 

m  x,y 

where  <3=  {oq . . .  a &}  is  the  vector  of  Zernike  expansion  coefficients  for  the  aberration  phase. 
Applying  both  the  convolution  theorem  and  Parseval’s  theorem,  (4.49)  can  be  written 


L(o,  a)  =  -  ^  v)  _  °('u’  v)Sm{u ,  u)]2  (4.50) 

m  u,v 

where  Dm ,  O,  and  Sm  are  Fourier  transforms  of  dm .  o,  and  sm  respectively.  The  maximum 
likelihood  estimate  for  a  and  o  is  then  found  by  maximizing  (4.50).  This  is  equivalent  to 
minimizing  the  mean  squared  error  metric  used  by  Gonsalves  [23] .  Furthermore,  Gonsalves 
showed  that  the  dependence  on  o  could  be  removed  by  forming  a  closed-form  estimate  for 
the  object  given  the  data  and  aberration  defined  by 


O  M(u,v)  = 


Dfp{u ,  v)S*FP(u ,  v)  +  Dppju,  v)S*DP(u ,  v) 
\SFP(u,v)\2  +  \SDP(u,v)\2 


(4.51) 
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which  is  essentially  an  application  of  a  Weiner  filter  to  the  available  data. 

At  this  point,  (4.51)  can  be  substituted  back  into  (4.50)  resulting  in  a  new  objective 
function  that  only  depends  on  the  aberration  parameters,  greatly  reducing  the  space  over 
which  maximization  must  be  done.  The  reduced  objective  function  is  given  by 


LM{a) 


E 


\DFp(u,v)SFp(u,v)  -  DDp(u,v)Sdp(u,v)\2 
\SFp{u,v)\2  +  |  SDP(u,v)\2 


(4.52) 


Paxrnan  et  al.  [39]  showed  that  an  equivalent  objective  function  requiring  fewer  com¬ 
putations  could  be  written 


LM{a) 


E 

(u,v)eCi 


T,mDm(u,v)S^(u,v)\2 

J2m\Sm(u,v)\2 


Y  Y \Dm{u,v)\2 

(■ u,v)eC2  m 


(4.53) 


where  the  space  (  containing  (u,  v )  has  been  partitioned  into  (  =  {Ci  U  C2}  where  C 2  contains 
all  points  where  Sm(u,  v)  =  0  for  all  values  of  m  G  {FP,  DP}.  Either  (4.52)  or  (4.53)  is  then 
maximized  over  the  space  of  aberration  parameters  to  determine  the  unknown  aberration. 
The  aberration  is  then  used  to  generate  the  PSFs  which  are  then  substituted  into  (4.51)  to 
generate  a  reconstructed  image. 

The  maximization  of  (4.52)  or  (4.53)  can  be  done  using  any  appropriate  optimization 
routine.  However,  most  efficient  nonlinear  optimization  methods,  such  as  quasi-Newton’s 
methods,  require  the  computation  or  estimation  of  gradients.  Paxrnan  et  al.  [39]  showed 
that  the  required  gradients  were  given  by 


dL 


•M 


dai 


-4  Y  & k V^Im  1  X!  Hm(U’  ®  Hrn)  v )  \  (4-54) 


where  Im  {•}  is  the  imaginary  part  of  the  argument  and 


Zm(u,v )  =  < 


Eil^l 


(4.55) 


0, 


(u,v)  e  Co 


The  dependence  on  (u,v)  in  (4.55)  is  implied,  but  for  brevity  has  not  been  written  out. 
Because  tractable  analytic  expressions  for  the  gradients  are  available,  standard  optimization 
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methods  such  as  quasi-Newton’s  methods  can  be  used  to  determine  the  maximum  likelihood 
estimates  for  a.  and  the  associated  reconstructed  image  o. 


4-3.3  Poisson  Noise  Limited  Detection.  While  additive  Gaussian  noise  sources 
lead  to  tractable  estimates,  in  several  operating  regimes  of  interest  the  dominant  noise 
source  is  photon  noise  characterized  by  Poisson  statistics.  Assuming  that  the  noise  process 
is  spatially  independent,  the  probability  distribution  for  a  data  set  is  given  by 


Pv(V)  =  J  j  ]  J 


m  x,y 


gm{x ,  y)d^x^  exp  [~gm(x,  y)\ 
dm(x,y)\ 


(4.56) 


where  as  with  the  Gaussian  noise  case,  T>  is  the  data  set  containing  both  FP  and  DP  data. 
Using  (4.56),  the  log- likelihood  function  for  the  object  and  aberration  is  then  given  by 


[dm(x,y)lngm(x,y)  -  gm(x,y)] 

m  x,y 


(4.57) 


where  terms  that  don’t  depend  on  o  or  a  have  been  dropped  for  convenience.  Considering 
the  second  term  in  (4.57) 


Y  Y  9m<^x'  =YYY  °(£>  -^y-v) 

m  x,y  £,r) 


m  x,y 


(4.58a) 

(4.58b) 


;  Y  Y  °  ^  Y  Sm(x 

m  £,r)  X,y 

Y  Y  °(f ’  Y  \H™(U> v )  exP  {-j27T  i(x  -  0u  +  (y-  y)^}} I2  (4.58c) 

m  £,77  u,v 

i2 


Y 0  (£> Y  Y  «)r 


m  u,v 


Y°^'^YYp2^v^ 

^,77  m  u,v 


(4.58d) 

(4.58e) 


where  Parseval’s  theorem  and  the  Fourier  shift  theorem  have  been  used.  Note  that  the 
double  sum  over  the  pupil  function  P(u,  v )  is  neither  a  function  of  the  object  o  nor  the 
aberration  a.  Letting 

f-EE  P2(u,v)  (4.59) 
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(4.58)  becomes 


Y  Y  gm(^x'  y)=^Y°  (x>  y) 


(4.60) 


m  x,y  x,y 


x,y 


and  (4.57)  simplifies  to 


[dm(x,  y )  In  gm(x,  j/)]-7^°  (®>  y) 


(4.61) 


Removing  explicit  dependence  on  o  from  (4.61),  as  was  done  for  additive  Gaussian 
noise,  requires  that  the  object  that  maximizes  (4.61)  for  a  given  aberration  a  be  found 
and  substituted  back  into  (4.61).  However,  a  closed  form  solution  to  that  problem  has  not 
been  found.  The  approach  then  is  to  attempt  to  maximize  the  objective  over  the  complete 
space  of  pixel  intensities  and  aberration  parameters.  With  the  expanded  parameter  space 
over  which  to  maximize,  utilization  of  an  efficient  large-scale  optimization  routine  becomes 
even  more  important.  To  this  end,  computation  of  the  gradients  with  respect  to  both  the 
aberrations  and  the  object  pixels  is  the  next  logical  step. 

Differentiating  (4.61)  with  respect  to  o (x0,y0) 


(4.62a) 


(4.62b) 


To  compute  the  partial  derivative  of  gm,  (4.13)  and  the  definition  of  discrete  convolution 
are  applied  to  give 


(4.63a) 


=sm(x  -x0,y-  ya) 


(4.63b) 


Substituting  this  into  (4.62)  gives  the  final  form  of  the  partial  derivative  of  L  with  respect 
to  o  (x0,y0) 
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Differentiating  (4.61)  with  respect  to  aberration  parameter  a k  results  in  an  expression 


similar  to  (4.62)  with  the  exceptions  that  the  constant  term  7  is  dropped,  and  y  ^  is 


replaced  by  ^ 


dL 

doth 


EE 

m  x,y 


dm(x,y)  dgm(x,y) 
9m(x,y)  dak 


(4.65) 


The  partial  derivative  of  gm(x,y )  with  respect  to  ak  is  then  found  by  using  (4.13)  and  the 
definition  of  discrete  convolution,  resulting  in 


dgm(x,y ) 
dak 


U,V 


dsm(x  —  u,y  —  v) 
dak 


(4.66) 


Using  the  definition  of  sm  from  (4.44)  the  partial  derivative  of  sm(x,  y)  with  respect  to  ak 
can  be  written 


dsm(x,y) 

dak 


h*m(x,y) 


dhm(x,y) 

dak 


+  hm(x,y ) 


dh*m{x,y) 

dak 


(4.67) 


Applying  the  definitions  from  (4.40)  and  (4.45)  to  the  partial  derivative  of  hm(x,y)  and 
interchanging  the  order  of  integration  and  differentiation  results  in 


dhm(x,y ) 
dak 


=  T\ 


|  v)P(u,  v )  exp  [j  ( 'pmip4:(u ,  v)  +  ^2  anipn{u,  v)')  j  j  (4.68) 


where  /3fp  =  0  and  (3 dp  defines  the  defocus  diversity  phase.  The  partial  derivative  of  sm 
with  respect  to  ak  is  then 


dsm(x,y) 

dak 


=s'm(x,y) 


y) jrfkiu,  v)P(u,  v)  exp  j  v 


+  y ^anijjn(u,v 

n 


(4.69a) 

(4.69b) 
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where  %e{-}  is  the  real  part  of  the  argument.  Substituting  (4.69b)  back  into  (4.66)  and 
(4.65)  results  in  the  final  form  for  the  derivative  of  L  with  respect  to  ak 


dL 

dak 


-£ 

m,x,y 


dm  (x,  y ) 

9m  (x, y ) 


'^2°(£,v)s'(x -£,y  -  rj) 


-£ 

m,rr,y 


rfm  (-T  y) 
9m  (x, y ) 


[o(x,y)  <S)  s'(x,y)] 


(4.70a) 

(4.70b) 


At  this  point  the  objective  function  and  gradients  can  be  used  with  any  applicable  nonlinear 
optimization  routine  to  simultaneously  develop  an  estimate  of  the  object  and  aberration 
parameters. 


4-3-4  Multi-Frame  Phase  Diversity  Reconstruction.  Seldin  and  Paxrnan  expanded 
phase-diversity  image  reconstruction  for  Poisson  statistics  to  allow  for  multiple  data  frames 
where  each  frame  has  been  corrupted  by  a  different  atmospheric  aberration  [45].  This  re¬ 
sulted  in  a  hybrid  between  multi-frame  blind  deconvolution  and  conventional  phase  diversity 
reconstruction  that  allowed  for  better  handling  of  noisy  data  such  as  that  encountered  at 
low  light  levels. 

Defining  the  aberration  set  a  as 

a  =  {a1,  a2, . . . ,  aF]  (4-71) 

where  is  a  vector  describing  the  aberration  for  the  fth  frame,  the  point  spread  functions 
are  given  by 

resulting  in 

gfm{xiy)  =  °(x,y)  ®  sL(x>y)  (4-73) 


T <  P(u,  v )  exp 


-j  /3mip4(u,v)  +  j2^fkMu,v) 


(4.72) 
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where  m  E  {FP,  DP}.  Assuming  statistical  independence  between  data  frames,  the  joint 
probability  distribution  for  a  set  of  F  frames  of  phase  diversity  data  is  then  given  by 


pv(P)  =  nnn 

f  m  x,y 


(4.74) 


where  /  E  {1....F}  and  the  data  set  V  contains  F  sets  of  phase-diversity  data.  Other 
than  expanding  the  search  space  to  include  additional  aberration  parameters,  the  further 
development  of  this  formulation  is  identical  to  that  presented  in  section  4.3.3  resulting  in 
an  objective  function  given  by 


l(°,  «)  =  d^x' ln _  9^x' y ) 


x,y 


(4.75) 


where  the  /  superscript  indexes  the  data  frames.  The  model  images  gm{x,  y )  are  determined 
as  before,  using  the  aberration  parameters  for  the  current  data  frame.  The  gradients  are 
identical  to  those  of  (4.64)  and  (4.70)  for  any  given  value  of  /. 

Seldin  and  Paxrnan  [45]  observed  that  the  algorithm  tended  to  converge  to  local  max¬ 
ima  that  did  not  represent  the  known  object.  In  an  effort  to  better  condition  the  problem, 
they  incorporated  a  “sieve”  to  enforce  some  degree  of  smoothness  on  the  reconstruction.  In 
its  basic  form,  the  sieve  is  a  convolution  kernel  used  to  help  eliminate  local  maxima  in  the 
objective  function.  As  implemented,  the  object  was  defined  by 


°  {x,  y)  =  ^2  7^v(x  (4-76) 

tv 

where  v(x,  y)  was  an  otherwise  unspecified  two-dimensional  Gaussian  convolution  kernel. 
At  this  point,  gm{x,y)  is  then  defined  as 

3m(z,  =  °'(&  v)v(x  -£,y-  v)sL(u  -€,v-v)  (4-77) 

u,v 

The  underlying  o'  is  estimated  using  the  algorithm,  then  used  in  (4.76)  to  reconstruct  the 
image.  Similar  techniques  have  also  been  suggested  to  better  condition  single-frame  phase 
diversity  [22]. 
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When  the  unknown  aberration  is  due  to  atmospheric  turbulence,  the  known  statistics 
of  the  atmosphere  can  be  introduced  into  the  problem  to  help  further  condition  the  prob¬ 
lem.  Assuming  Kolmogorov  statistics  for  the  atmospheric  aberration  and  a  Zernike-mode 
expansion,  the  covariance  of  the  Zernike-mode  turbulence  coefficients  is  known  [37]  and  can 
be  added  as  a  regularization  term  to  the  objective  function  of  (4.75). 

Thelen  et  al.  [48]  proposed  a  framework  for  incorporating  the  prior  knowledge  about 
the  aberration  into  the  objective  function.  Noting  that  the  aberration  parameters  are  cor¬ 
related  Gaussian  random  variates,  the  joint  probability  distribution  function  for  a  given  set 
of  N  aberrations  is  given  by 

Ps'(S,)  =  n/(2^)«  |r,| exp  (4  (,s'Tr'1<s/)  <478) 

where  T  is  a  transpose,  a f  is  a  vector  composed  of  the  Noll-ordered  aberration  parameters 
for  the  fth  frame,  and  |TZ|  is  the  determinant  of  the  covariance  matrix  Tz.  This  probability 
distribution  may  then  be  converted  into  a  log-likelihood  function  given  by 


=  <4-79) 
/ 

which  can  then  be  incorporated  into  (4.75)  resulting  in  a  modified  objective  function  for  the 
multi-frame  case  given  by 


T'(° ,a)  =  \dfm(x,y)lngf,1(x,y)  -  gfm(x,y) 


x,y 


(4.80) 


The  only  additional  information  now  required  before  passing  (4.80)  to  a  gradient-type  opti¬ 
mization  routine  is  the  gradient  of  (4.79).  Given  that  (4.79)  does  not  depend  on  the  object 
o,  that  portion  of  the  gradient  is  clearly  zero.  For  the  partial  derivatives  with  respect  to  to 
individual  aberration  parameter  an,  Thelen  et  al.  showed  that 


£^  =  -(^“1  <481> 

i.e.  the  nth  element  of  the  vector  that  results  from  multiplication  of  the  correlation  matrix 
inverse  with  the  fth  aberration  vector. 
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At  this  point,  the  tools  required  for  building  a  system  model  are  complete,  and  all  of 
the  pieces  are  in  place  to  begin  development  of  a  statistical  image  reconstruction  algorithm. 
The  only  remaining  step  is  to  combine  the  estimation  and  optimization  methods  given  in 
Chapter  II  with  the  system  models  developed  in  Chapters  III  and  IV. 
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V.  Active  Phase  Diversity  Imaging 


One  critical  failure  of  conventional  imaging  techniques  is  their  reliance  on  natural 
illumination.  There  exist  many  scenarios  of  interest  where  the  object  to  be  imaged 
is  in  the  earth’s  shadow  and  cannot  be  imaged  using  conventional  methods.  An  approach 
to  overcoming  this  shortcoming  is  to  use  lasers  to  actively  illuminate  the  target,  in  effect 
implementing  a  “flash  bulb”  for  the  imaging  system.  However,  the  unique  properties  of 
laser  light  introduce  both  complications  and  possibilities  that  are  not  present  with  natural 
or  incoherent  light.  With  the  understanding  of  estimation  methods,  diffraction,  imaging, 
and  statistical  properties  of  coherent  light  developed  in  chapters  II  through  IV,  it’s  possible 
to  begin  development  of  an  image  reconstruction  algorithm  appropriate  for  an  active  imaging 
system. 

Several  techniques  have  been  proposed  over  the  years  to  utilize  coherent  illumination 
to  reconstruct  an  image  including  imaging  correlography  [17,44,50],  sheared-beam  inter¬ 
ferometry  [50],  root  reconstructors  [31],  reference- wave  heterodyne  detection  [6,7],  analytic 
methods  [28],  iterative  statistical  estimation/deconvolution  methods  [30,43],  optimization 
methods  [34],  and  Fourier-telescopy  [3,29,32,50].  The  configuration  being  considered  is  an 
active  PD  imaging  (APDI)  system  as  shown  in  Fig.  5.1  and  consists  of  a  conventional  PD 
imaging  system  with  a  laser  illuminator  and  an  additional  sensor  to  record  PP-intensity. 
The  following  sections  detail  the  algorithmic  modifications  required  to  expand  PD  imaging 
as  described  in  Chapter  IV  to  the  case  of  active  illumination. 


Figure  5.1:  An  active  PD  imaging  system  consists  of  a  conventional  PD  system  with  an 
additional  sensor  to  record  PP  intensity  and  a  laser  illuminator 
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5.1  Noise  Model 


The  first,  and  possibly  most  significant  difficulty  in  working  with  coherent  illumination 
is  the  introduction  of  speckle  noise  as  described  in  chapter  III.  Because  nearly  all  surfaces 
are  rough  on  the  scale  of  an  optical  wavelength,  the  wave  reflected  from  the  object  will  have 
a  spatially  random  phase,  that  when  propagated  will  cause  the  various  field  components  to 
interfere  and  produce  the  grainy  speckle  patterns  characteristic  of  reflected  laser  light.  As 
shown  previously,  speckle  noise  follows  exponential  statistics  such  that  for  a  given  pixel 


Pi(I) 


E[/] 


exp 


I 

E[7]. 


(5.1) 


where  it  has  been  assumed  that  I  >  0. 

One  of  the  biggest  consequences  of  laser  speckle  is  its  extremely  noisy  nature  as  demon¬ 
strated  by  the  fact  that  the  mean  and  standard  deviation  are  equal.  Defining  the  signal-to- 
noise  ratio  (SNR)  as  the  ratio  of  the  mean  signal  with  the  noise  standard  deviation,  speckle 
noise  maintains  a  SNR  of  1  regardless  of  the  signal  strength.  Unlike  photon  or  detector  noise 
sources,  SNR  doesn’t  improve  with  higher  signal  levels.  As  a  result  of  the  noisy  nature  of 
active  illumination,  a  multi-frame  approach  will  be  required  to  help  mitigate  noise  effects. 

While  any  one  pixel  in  the  image  plane  follows  the  statistics  of  (5.1),  the  joint  statistics 
of  the  detected  intensity  are  required  to  develop  a  reconstruction  algorithm.  Given  that 
the  complex  field  is  Gaussian  distributed  in  all  planes,  one  could  ideally  perform  variable 
transformations  on  the  joint-Gaussian  distribution  as  was  done  for  (3.40),  and  integrate  out 
all  phase  dependence.  However,  beyond  the  second-order  distribution  derived  in  Chapter 
II  the  applicable  integrations  and  transformations  become  intractable.  As  a  result,  an 
approximate  form  for  the  joint-distribution  must  be  developed. 

The  simplest  approximation  is  to  assume  statistical  independence  between  pixels,  and 
form  the  joint  distribution  by  multiplying  the  marginals.  For  this  simplification  to  be  valid, 
the  correlation  between  data  points  must  be  negligibly  small.  In  the  active  imaging  case, 
the  required  correlation  can  be  found  by  using  the  mutual  intensity  in  accordance  with  the 
complex  Gaussian  moment  theorem  of  (3.34).  As  shown  in  chapter  III,  the  mutual  intensity 
in  the  PP  of  the  imaging  system  can  be  described  by  a  scaled  Fourier  transform  of  the 
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object.  This  mutual  intensity  can  then  be  propagated  to  the  FP  and  DP  of  the  imaging 
system  using  the  more  general  (3.24).  However,  the  required  integrals  become  prohibitively 
difficult.  Fortunately,  it  was  shown  by  Zernike  that  in  nearly  all  cases  of  interest,  the  light 
in  the  PP  of  a  coherent  imaging  system  could  be  approximated  as  a  constant  intensity 
incoherent  source  [24].  Under  this  assumption,  the  Van  Cittert-Zernike  theorem  can  be 
used  to  determine  the  FP  and  DP  mutual  intensities. 

Substituting  the  modulus  squared  of  the  imaging  system’s  pupil-function  in  for  the 
intensity  in  (3.31)  and  normalizing 

r  poo 

U,  Aw)  =  \P(x,y)\2  exp 

J  J  —  OO 

where  nc(Au,  Av)  is  the  complex  correlation  coefficient  for  points  separated  by  (An,  Av). 
For  a  clear  circular  aperture,  this  correlation  is  given  by  the  scaled  point-spread  function 
of  the  un-aberrated  imaging  system  where  the  spatial  variables  (x,  y )  have  been  replaced 
by  (Ax,  Ay).  Assuming  a  clear  circular  pupil,  the  modulus  of  the  mutual  intensity  falls  off 
rapidly  with  increasing  point  separation  as  shown  in  Fig.  5.2.  Assuming  that  the  pixel  spac¬ 
ing  of  the  detector  is  small  compared  with  the  width  of  the  PSF,  measured  pixel  intensities 
are  hereafter  assumed  statistically  independent. 

One  major  failure  of  the  assumptions  used  to  generate  Fig.  5.2  is  that  they  provide 
no  insight  into  the  average  detected  intensities.  An  alternative  approach  for  finding  these 
quantities  stems  from  the  description  of  the  mutual  intensity  in  the  object  plane  given 
by  (3.29).  This  description  is  identical  to  that  of  an  incoherent  source,  and  the  effect  of 
averaging  across  many  speckle  realizations  will  be  equivalent  to  that  obtained  for  incoherent 
illumination.  Because  of  this,  the  modeled  noiseless  images  representing  the  mean  of  the 
statistical  distributions  are  described  by  the  incoherent  image  formation  model  of  (4.13). 

An  additional  consideration  is  the  temporal  correlation  between  speckle  realizations. 
In  the  absence  of  any  relative  motion  between  the  object,  illumination  source,  and  imaging 
system;  the  phase  relationships  that  give  rise  to  speckle  will  remain  constant,  and  speckle 
observed  in  the  observation  planes  will  be  stationary.  However,  even  small  relative  changes 
on  the  order  of  a  wavelength  are  sufficient  to  significantly  change  the  speckle  pattern.  For 
this  work  it’s  assumed  that  the  time  over  which  speckle  realizations  become  de-correlated 


27T  j 

-j  —  (A ux  +  Avy)  >  dxdy 

J 


(5.2) 
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Figure  5.2:  Approximate  normalized  correlation  magnitude  as  a  function  of  point  sep¬ 

aration  for  a  1  m  F10  telescope  operating  at  1/rm.  The  correlation  falls  off  rapidly  with 
increasing  point  separation. 

is  small  compared  with  the  sampling  time  of  the  imaging  system.  Therefore,  temporal 
independence  of  speckle  realizations  is  assumed. 

Finally,  to  form  the  approximate  joint  probability  distribution,  independence  between 
observation  planes  is  assumed.  Though  not  rigorously  justified,  this  assumption  is  necessary 
to  form  a  tractable  reconstruction  algorithm.  Combining  these  simplifications  and  assump¬ 
tions  to  form  a  joint  distribution  similar  to  (4.56),  the  joint  probability  distribution  for  a 
multi-frame  active  PD  data  set  is  given  by 


d(n(x,  y) 


(5.3) 


where  gfn(x,y )  is  the  ensemble  average  intensity  in  the  FP  or  DP  for  the  fth  frame  and  is 
as  given  in  (4.13). 
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5.2  Static  Aberrations 


The  first  reconstruction  case  considered  is  when  the  image  is  blurred  by  an  aberration 
that  isn’t  changing  across  data  frames.  This  could  represent  a  system  that  suffers  from 
an  unknown  aberration  due  to  manufacturing  defects  or  misalignment,  or  a  system  with  a 
slowly  varying  aberration. 


5.2.1  Basic  Objective  Function.  For  a  static  aberration,  the  model  blurred  image 
gm(x,  y )  is  the  same  across  all  data  frames,  resulting  in  a  slightly  simplified  probability 
distribution  given  by 


Pd(V) 


n 

f,m,x,y 


1 

9m{x,y) 


exp 


dfn(x,  y) 
9m{x,y ) 


(5.4) 


Re-casting  this  as  a  likelihood  equation  for  the  object  and  aberration,  the  log-likelihood  is 
given  by 


L{o,  a)  =  —  ^2  I  ln  (Imix,  y) 

f,m,x,y  \ 


dm(x,  y) 

9m(x,y ) 


(5.5) 


Because  only  the  data  is  frame-dependent,  the  summation  over  /  can  be  distributed  such 
that  for  F  frames  of  data 


L(o,a) 


£ 

m,x,y 


^F  In  gm(x,y)  + 


1 

9m(x,y ) 


^2dLix,y) 

f 


(5.6) 


5.2.2  Pupil  Plane  Data  Regularization.  The  noisy  nature  of  laser  speckle  makes 
regularization  particularly  important.  However,  the  speckle  can  also  provide  additional 
information  that  can  be  used  for  regularization.  Consider  the  object  information  encoded 
in  the  speckle  pattern  observed  in  the  PP  of  the  imaging  system  as  described  in  section 
3.4.2.  Restating  (3.38),  the  normalized  PP  speckle  PSD  can  be  described  by 


$pp(x,  y)  =  d(x,  y)  +  jo(x,  y)  ©  o(x,  y) 


(5.7) 


where  the  frequency  variables  of  the  PSD  have  been  replaced  with  (x,  y)  for  convenience 
in  later  calculations,  <hpp(x,y)  is  the  ensemble  averaged  PP-PSD,  and  ©  represents  a  two- 
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dimensional  autocorrelation.  Consequently,  a  large  amount  of  information  about  the  object 
is  contained  in  the  PP  intensity  which  can  be  used  to  help  regularize  the  problem. 

Before  this  information  can  be  used,  a  statistical  model  for  the  PSD  must  be  formed. 
As  with  all  data  from  an  actively  illuminated  system,  the  PP  data  will  suffer  from  speckle 
noise.  Recall  that  the  intensity  in  the  PP  is  exponentially  distributed  with  a  constant  mean 
across  the  aperture.  Using  the  definition  for  the  PSD  as 

$pp(x,y)  =  \F{Ipp(£,ri)}\2  (5-8) 

where  Ipp  is  the  normalized  intensity  in  the  PP,  we  can  consider  the  effect  of  speckle 
statistics  on  the  PSD. 

Writing  the  Fourier  transform  in  its  integral  form, 

p  poo 

T{IPP(x ,  y)}=  /  /  Ipp(x,  y)  exp  [-j  ( ux  +  vy)]  dxdy  (5.9) 

J  J  —  OO 

it  can  be  interpreted  as  an  infinite  sum  of  unit-magnitude  complex  numbers  weighted  by 
exponentially  distributed  scalars.  If  the  Ipp(x,y )  pixels  are  independent,  by  the  central 
limit  theorem  the  resulting  real  and  imaginary  components  of  the  Fourier  transform  will 
have  Gaussian  statistics.  Though  Ipp  isn’t  strictly  spatially  independent,  a  histographic 
analysis  of  simulated  data  shows  that  the  transformed  intensity  closely  follows  Gaussian 
statistics.  To  get  the  distribution  of  the  PSD,  the  same  variable  transformations  that  were 
used  in  (3.28)  are  applied  consistent  with  (5.8),  resulting  in  an  exponential  distribution  for 
any  given  point  in  the  PSD. 

In  order  to  form  a  tractable  joint  distribution,  spatial  independence  must  again  be 
assumed.  According  to  diffraction  theory,  propagation  of  light  to  the  FP  of  an  imaging 
system  is  mathematically  equivalent  to  a  Fourier  transform.  Using  this  insight,  the  same 
arguments  that  justified  spatial  independence  for  the  FP  and  DP  again  apply.  The  mutual 
intensity  in  the  now  mathematically  defined  FP  is  proportional  to  the  Fourier  transform  of 
the  imaging  system’s  pupil,  and  consequently  falls  off  rapidly  for  small  point  separations. 
Statistical  independence  is  again  assumed,  and  the  joint  probability  distribution  for  the 
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PP-PSD  is  then  given  by 


P>!>pp(^pp) 


n 


1 

^ - exp 

$pp(x,y) 


$pp(x,y) 

&pp(x,y) 


(5.10) 


To  negate  any  numerical  difficulties  introduced  by  the  5(x,y)  in  (5.7),  it’s  subtracted 
out  of  the  PSD  during  computation.  While  doing  so  causes  the  PSD  at  the  origin  to  no 
longer  follow  the  predicted  statistics,  this  effect  is  ignored  for  simplicity.  The  probability 
distribution  for  the  modified  speckle  PSD  is  then  given  by 


Pdpp(dpp)  = 


z,  y 


gpp(x,y) 


exp 


dpp(x,  y) 
gpp{x,y ) 


(5.11) 


where  the  PP  data  for  the  fth  frame  is  defined  as 


Z 

dfpp(x,y)  =  F{Ipp{€,,T})}  -5(x,y) 


(5.12) 


and  the  model  data  is  defined  as 


gpp{x ,  y)  =$pP(x,  y )  -  S(x,  y) 


(5.13a) 


=o(x,y)®o(x,y) 


(5.13b) 


It  should  be  noted  that  in  this  case  the  model  data  doesn’t  depend  on  the  aberration, 
resulting  in  a  lack  of  dependence  on  either  /  or  a.  Additionally,  gpp  is  invariant  to  rotation 
of  o  through  180°. 


At  this  point,  the  log-likelihood  function  for  the  object  given  a  PP  data  set  can  be 
written 


App(o) 


E 

f,x,y 


In  gPP(x,y) 


dpP(x,y ) 
gpp{x,y ) 


(5.14) 


This  likelihood  function  can  then  be  added  to  the  likelihood  function  of  (5.5)  to  produce 
the  regularized  objective  function  for  image  reconstruction 


T(o  ,5) 
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In  {gm(x,y))  + 


dL{x,y) 
gm{x,y ) 


(5.15) 
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where  now  m  E  {FP,  DP,  PP}  to  include  the  PP  regularization  term. 


5.2.3  Optimization  Methodology.  As  with  the  Poisson- noise  limited  case,  this 
objective  function  doesn’t  lend  itself  to  a  reduction  in  parameter  space  as  the  Gaussian 
noise  model  did.  As  a  result,  large-scale  optimization  methods  must  be  employed  to  si¬ 
multaneously  solve  for  both  the  object  and  aberration  parameters.  Recognizing  that  the 
most  efficient  optimization  methods  tend  to  require  gradient  information,  the  first  step  to¬ 
wards  optimization  is  gradient  derivation.  For  the  objective  function  of  (5.15),  the  partial 
derivative  of  L(o,  a)  with  respect  to  any  parameter  x  will  be  given  by 

d£j{o,  (r)  v 

dx  , ^ 

f,m,x,y 

The  task  then  remains  to  compute  the  partial  derivative  of  gm(x,  y)  with  respect  to  x- 

Because  the  model  images  for  both  the  FP  and  DP  are  the  same  as  those  for  conven¬ 
tional  PD  imaging,  the  partial  derivatives  of  gpp  and  gpp  with  respect  to  o (x0,y0)  and  an 
that  were  previously  derived  in  (4.63)  and  (4.66)  remain  valid.  They  are  re-stated  here  as 

(5-17) 

and 

dg"dak  ^  =  0(yX ’ V ^  ®  V ^  (5-18) 
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and  m  E  {FP,  DP}. 

Because  gpp  doesn’t  depend  on  a  its  partial  derivatives  with  respect  to  a k  are  zero. 
The  final  piece  required  for  the  gradient  is  the  partial  derivative  of  gpp  with  respect  to 
° (x0,y0).  The  autocorrelation  in  (5.13)  is  written  as 


gpp(x,  y)  =  °(£>  7?)°(x  +  i,y  +  g) 

(,,v 


(5.20) 
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differentiating  (5.20)  with  respect  to  o(x0,y0 )  interchanging  the  order  of  differentiation  and 
summation,  and  applying  the  product  rule,  the  partial  derivative  of  gpp(x,y )  with  respect 
to  o(xc,  ya)  is  given  by 


dgpp{x,y) 

— — - -  =  o(x0  -x,y0-y)+  o(x0  +  x,ya  +  y) 

do  (x0,y0) 


(5.21) 


One  final  consideration  before  an  optimization  routine  can  be  used  to  determine  the 
reconstructed  object  and  the  aberration  parameters  stems  from  the  form  of  both  (5.15)  and 
(5.16).  In  both  of  these  expressions,  the  objective  function  becomes  undefined  any  time 
gm(x,  y)  goes  to  zero.  In  this  case,  the  reconstruction  becomes  ill-conditioned,  and  the 
numerical  optimization  routine  will  likely  fail  to  converge  to  a  satisfactory  solution. 

One  possible  solution,  as  suggested  for  Poisson  noise  limited  PD  reconstruction,  is 
the  partitioning  of  the  objective  function  into  two  discrete  sets  (  =  {Ci  U  C2}  where  Co 
contains  all  points  where  gm(x,  y)  =  0  for  all  values  of  m.  All  contributions  to  the  objective 
function  from  any  point  contained  in  Co  are  then  thrown  out.  However,  this  approach 
doesn’t  adequately  deal  with  the  case  when  the  data  supports  a  non-zero  value  but  the 
current  guess  object  predicts  a  zero.  In  this  case  however,  the  gradient  should  indicate  a 
more  optimal  solution  with  a  non-zero  value  but  would  instead  indicate  a  zero  slope  and  an 
optimal  answer.  Furthermore,  numerical  roundoff  and  associated  instability  begin  to  have 
significant  negative  impacts  when  gm(x,y )  and  dm(x,y)  — *  0,  which  isn’t  adequately  dealt 
with  by  partitioning  the  solution  space. 

As  an  alternative,  a  better  conditioned  homotopic  problem  [33]  is  formed  by  adding  a 
bias  b  to  both  the  data  and  the  model  such  that 


L'(o,a) 


E 


ln(gm(x,  y)  +  b)  + 


dm(x,y)  +  b 
9m  (x,  y)  +  b 


(5.22) 
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The  homotopic  problem  converges  uniformly  to  the  original  problem  as  b  — >  0.  The  added 
bias  can  be  viewed  as  an  artificial  detection  dark-current  or  background  illumination.  The 
modified  objective  function  is  then  passed  to  the  numerical  routine  to  obtain  reconstructed 
images.  The  optimal  value  for  b  is  explored  in  Chapter  VI. 

5.3  Dynamic  Aberrations 

To  allow  for  a  much  broader  range  of  operational  scenarios,  the  unknown  aberration 
must  be  allowed  to  vary.  For  the  cases  of  interest,  the  aberration  evolves  with  the  changing 
atmosphere,  blurring  each  data  frame  differently.  The  changing  aberrations  correspond  with 
a  new  set  of  aberration  parameters  for  each  frame  that  must  be  included  in  the  objective 
function.  To  deal  with  this,  an  expanded  aberration  set  a  is  defined  as  in  (4.71),  such  that 
for  F  data  frames 

a  =  [a1,  a2, . . .  aF]  (5-24) 

where  a?  is  the  aberration  vector  for  the  fth  data  frame. 

5.3.1  Basic  Objective  Function.  The  only  fundamental  change  to  the  likelihood 
function  of  (5.15)  required  to  extend  the  problem  to  include  dynamic  aberrations  is  to 
modify  the  model  data  gm(x,y)  such  that  it  depends  on  the  frame  number  /  as  in  (4.73). 
With  this  change,  the  basic  objective  function  is  given  by 

L(o,ol)  =  - 

where  m  G  {FP,  DP}  and  all  other  terms  are  as  previously  described. 

5.3.2  Pupil  Plane  Data  Regularization.  Because  the  PP  data  is  insensitive  to 
the  phase  aberration,  the  PP  regularization  term  of  (5.14)  is  added  to  the  basic  objective 
function  of  (5.25)  without  modification.  Incorporating  this  into  the  basic  objective  of  (5.25) 
results  in  the  PP-regularized  objective  function 
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where  now  m  E  {FP,  DP,  PP}. 


Without  further  changes,  this  objective  function  could  be  passed  to  the  solving  routine 
to  produce  reconstructed  images.  However,  additional  information  becomes  available  which 
can  be  incorporated  into  the  algorithm  when  multiple  aberration  realizations  are  allowed. 

5.3.3  Phase  Aberration  Regularization.  Kolmogorov  theory  for  turbulence  pre¬ 
dicts  a  well-defined  power  spectral  density  for  phase  fluctuations  caused  by  atmospheric 
turbulence,  parameterized  by  a  measurable  quantity  ro  [2].  With  multiple  realizations  of 
the  induced  phase  and  information  about  ro,  we  can  begin  to  form  an  estimate  for  this 
PSD  and  penalize  the  objective  function  for  realizations  that  deviate  from  the  predicted 
standard.  However,  given  that  the  aberration  is  parameterized  by  Zernike-mode  expansion 
coefficients,  a  more  efficient  method  is  to  directly  relate  the  PSD  to  the  expansion  coeffi¬ 
cients.  Because  the  nature  of  the  turbulence  is  not  changed  by  the  use  of  active  illumination, 
Thelen’s  regularization  term  of  (4.79)  can  be  used  without  modification. 

In  addition  to  the  inter-mode  correlation,  the  temporal  evolution  of  the  aberration 
may  be  similarly  useful.  Under  Taylor’s  frozen  flow  hypothesis  [41],  the  internal  structure 
of  turbulence  changes  slowly  compared  with  the  velocity  at  which  it  passes  by  the  field 
of  view.  With  this  hypothesis,  and  knowledge  about  the  velocity  and  direction  of  the 
wind  moving  the  turbulence,  the  temporal  behavior  of  the  expansion  coefficients  can  be 
described.  Roddier  et  al.  [40]  developed  a  description  of  the  temporal  correlation  of  the 
Zernike  expansion  coefficients  under  Taylor’s  hypothesis.  Ten  Brummelaar  slightly  modified 
Roddier’s  results  [10].  Using  ten  Brummelaar’s  formulation,  temporal  correlations  for  the 
first  15  modes  excluding  piston  and  tip/tilt  were  computed  and  are  shown  in  Fig.  5.3  for 
a  1  m  aperture,  10cm  ro,  and  a  10 m/s  x-axis  oriented  wind  velocity.  The  correlations  drop 
rapidly  with  increasing  separation  and  mode  order. 

At  this  point,  a  regularization  term  similar  to  (4.79)  could  be  applied  to  the  objec¬ 
tive  function  by  building  the  temporal  correlation  matrices  for  each  aberration  parameter. 
However,  there  are  several  shortfalls  of  this  approach  that  make  it  less  attractive.  First, 
the  rapid  drop  in  the  correlation  with  small  time  separations  would  make  the  regularization 
term  particularly  susceptible  to  noise  and  other  error  sources  for  systems  with  a  moderately 
slow  frame-rate.  Second,  and  more  significantly,  this  approach  requires  that  the  turbulent 
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Temporal  Correlation,  v^lOm/s  D=1m  rQ=0.1m 


Figure  5.3:  Predicted  temporal  correlations  for  the  first  15  Zernike  modes  excluding  piston 
and  tip/tilt  assuming  a  1  m  aperture,  10cm  ro,  and  10 m/s  x-oriented  wind  velocity.  Note 
that  the  correlation  falls  off  rapidly  with  increasing  time  separation  and  mode-order. 

flow  be  adequately  described  by  a  single  layer  of  turbulence  moving  with  an  a-priori  known 
speed  and  direction.  While  the  effects  of  turbulence  on  a  single  frame  can  be  adequately 
described  by  a  single  turbulence  layer  in  the  PP,  realistic  turbulence  is  a  volume  effect. 
Given  that  wind  velocities  and  directions  vary  with  altitude,  describing  the  effective  motion 
of  the  turbulence  using  a  single  layer  becomes  significantly  difficult.  As  evidence  of  the 
problems  that  can  arise  when  predicting  temporal  statistics,  ten  Brummelaar  encountered 
difficulty  attempting  to  fit  his  derived  temporal  power  spectra  to  experimental  data  [9] .  The 
failure  of  predicted  and  measured  power  spectra  to  match  indicates  that  any  regularization 
attempting  to  use  temporal  statistics  could  potentially  have  a  significant  negative  impact 
on  image  reconstruction.  As  a  result,  no  information  regarding  the  temporal  evolution  of 
the  atmospheric  aberration  is  used  during  reconstruction. 

5.3.4  Optimization  Methodology.  Applying  both  the  PP-data  regularization  term 
of  (5.14)  and  the  phase  regularization  term  of  (4.79),  the  regularized  objective  function  for 
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dynamic  aberrations  is  given  by 
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(5.27) 


where  as  before  m  G  {FP,  DP,  PP}.  Once  again,  an  efficient  large-scale  solution  method 
will  be  required  to  obtain  the  desired  result.  Aside  from  the  introduction  of  the  time-varying 
aberation,  the  first  term  in  (5.27)  is  identical  (5.15),  and  as  a  result  the  gradients  will  be 
essentially  the  same.  The  only  impact  of  the  dynamic  aberration  is  to  require  computing 
the  gradient  of  gm{x,y)  with  respect  to  the  various  unknowns  for  each  frame. 

As  was  the  case  with  a  static  aberration,  the  objective  function  is  ill-behaved  for  points 
when  the  model  object  approaches  zero.  As  a  result,  the  same  bias  term,  b,  that  was  used 
previously  is  again  introduced  to  better  condition  the  objective.  The  resulting  modified 
objective  function  is  given  by 


Mo,  a)  =  - 
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f,m,x,y 


ln  (aUx,  y)  +  b)  +  +  b 

v  7  9m{x,y)  +  b 


+  hE(5/)  r*l5/  (5-28) 


The  modified  gradient  of  the  first  term  in  (5.28)  is  the  same  as  that  given  for  static-aberration 
case.  Because  the  second  term  in  (5.28)  doesn’t  include  b,  its  gradient  is  as  given  in  (4.81). 
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VI.  APDI  Results 


With  the  APDI  algorithm  complete,  the  next  logical  step  is  implementation  and 
performance  characterization.  Because  the  derived  algorithm  doesn’t  have  closed 
form  estimates  for  the  object  and  aberration,  an  analytical  characterization  was  impractical. 
Consequently,  a  Monte  Carlo  approach  was  used  to  characterize  algorithm  performance.  The 
estimator  was  implemented  using  the  L-BFGS-B  optimization  package  [11]  with  a  modified 
convergence  tolerance. 

6.1  Chapter  Overview 

Based  on  a  limited  set  of  initial  simulations,  optimal  conditioning  bias  and  convergence 
tolerance  appeared  mutually  independent,  and  independence  was  assumed  throughout.  A 
near-optimal  convergence  tolerance  was  selected  based  on  preliminary  simulations  and  used 
with  a  set  of  100  data  realizations  to  determine  the  optimal  bias  under  both  static  and 
dynamic  aberration  conditions.  Optimal  biases  for  the  static  and  dynamic  aberration  cases 
were  found  to  be  b  =  10%  and  b  =  50%  of  the  average  intensity  respectively. 

Once  the  optimal  bias  was  established,  the  optimal  convergence  tolerance  was  estab¬ 
lished  under  a  variety  of  conditions  including  SNR,  source  object,  array  size,  and  number 
of  data  frames.  The  optimal  convergence  tolerance  was  found  to  be  essentially  insensitive 
to  SNR,  source  object,  and  the  number  of  data  frames.  However,  the  optimal  tolerance  was 
found  to  be  strongly  dependent  on  the  size  of  the  detector  arrays.  Optimal  tolerance  values 
for  the  static  and  dynamic  aberration  cases  for  the  baseline  64  x  64  configuration  were  found 
to  be  7 toi  <  6  x  ICC8  and  7 toi  =  2  x  10“7  respectively. 

After  determining  approximately  optimal  system  parameters,  algorithm  performance 
for  SNR  values  in  the  range  2  <  SNR  <  20  with  10,  20,  30,  40,  and  50-frame  data  sets  for 
both  Gaussian  and  Poisson  dominated  detection  noise  was  evaluated.  The  algorithm  proved 
to  be  insensitive  to  detection  noise  for  SNR  >  7,  and  still  performed  relatively  well  for  SNR 
values  as  low  as  2.  The  selection  of  either  Gaussian  or  Poisson  dominated  noise  sources  made 
little  difference  in  overall  performance.  As  expected,  adding  additional  frames  to  the  data 
set  resulted  in  better  reconstructions.  For  the  static  aberration  case,  only  slight  gains  were 
seen  for  more  than  20  frames  as  the  noise  was  averaged  out.  For  the  dynamic  aberration 
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case,  as  additional  frames  were  added  the  image  improved  rapidly,  but  the  improvement 
with  additional  frames  began  to  taper  off  for  more  than  approximately  30  frames.  While 
the  improvement  tapered  off,  the  increase  in  computational  burden  climbed  nearly  linearly 
with  the  introduction  of  additional  frames. 

To  quantify  the  impact  of  PP  data  on  algorithm  performance,  variants  of  the  APDI 
algorithm  that  excluded  either  the  PP  or  DP  data  were  compiled.  The  PP-FP  only  and 
DP-FP  only  variants  were  then  run  along  with  the  original  APDI  algorithm  on  a  set  of  100 
simulated  data  realizations.  Performance  of  the  PP-FP  only  and  DP-FP  only  configurations 
were  then  compared  with  the  performance  of  the  full  APDI  algorithm.  As  expected, 
the  complete  APDI  algorithm  resulted  in  the  best  reconstructions.  Reconstructions  done 
without  PP  data  included  resulted  in  54%  and  148%  increases  in  average  MSE  compared 
with  the  Full  APDI  algorithm  for  the  static  and  dynamic  aberration  cases  respectively.  This 
shows  that  the  PP  data  has  a  dramatic  impact  on  overall  performance  of  the  reconstruction. 
Exclusion  of  the  DP  data  from  the  APDI  algorithm  resulted  in  only  a  11%  and  24%  increase 
in  MSE  for  the  static  and  dynamic  aberration  cases  respectively. 

Finally,  the  impact  of  the  aberration  regularization  term  developed  by  Thelen  et  al.  [48] 
was  determined  by  compiling  a  version  of  the  APDI  algorithm  with  the  regularization  term 
disabled.  Both  the  aberration-regularized  and  unregularized  algorithms  were  then  run  on  a 
collection  of  100  simulation  data  sets.  The  results  showed  that  addition  of  the  regularization 
term  resulted  in  neither  a  better  reconstruction  nor  a  lower  computational  burden. 

This  chapter  is  laid  out  as  follows:  section  6.2  describes  algorithm  implementation, 
section  6.3  outlines  data  simulation  techniques  that  were  used,  section  6.4  defines  the  perfor¬ 
mance  metrics  used  for  analysis,  section  6.5  describes  results  from  including  the  aberration 
regularization,  section  6.6  shows  the  effect  of  the  conditioning  bias  on  the  algorithm,  sec¬ 
tion  6.7  investigates  the  effect  of  system  parameters  on  convergence  tolerance,  section  6.8 
shows  the  benefits  and  costs  incurred  by  including  additional  Zernike  modes  in  the  aber¬ 
ration  model,  section  6.9  sows  the  impact  of  detection  noise  and  the  number  of  frames 
on  algorithm  performance,  section  6.10  shows  the  impact  of  PP  and  DP  data  on  image 
reconstruction,  and  section  6.11  describes  overall  algorithm  performance. 
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6.2  Algorithm  Implementation 

Given  the  large  space  over  which  optimization  must  be  done,  a  pre-packaged  bound- 
constrained  limited-memory  quasi-Newton’s  method,  L-BFGS-B  [11],  was  selected  for  op¬ 
timization  based  on  ease  of  use,  demonstrated  performance  for  large-scale  problems,  and 
the  ability  to  efficiently  constrain  the  solution  to  be  non-negative.  For  computational  effi¬ 
ciency,  circular  shifts  were  assumed  and  the  FFTW3  [19]  fast-Fourier-transform  library  was 
used  to  implement  the  required  convolutions  and  correlations.  The  objective  function  and 
other  components  of  the  algorithm  were  programmed  in  C  using  the  C99  standard  to  handle 
complex  numbers.  Code  used  to  implement  the  algorithm  is  included  in  Appendix  A. 

For  the  static  aberration  case,  much  of  the  computational  burden  of  the  reconstruction 
can  be  lifted  by  collapsing  the  data  set  into  a  single  set  of  averaged  PP,  FP,  and  DP  data. 
This  moves  many  of  the  computationally  expensive  operations  outside  the  solving  loop 
and  increases  speed,  making  the  problem  essentially  a  single-frame  reconstruction.  For 
implementation  purposes,  static-aberration  data  was  pre-averaged  before  being  passed  to 
the  multi-frame  reconstruction  algorithm  as  a  single- frame  data  set. 

Because  the  built-in  stopping  criteria  in  the  L-BFGS-B  algorithm  depends  on  machine 
precision,  and  because  reconstructions  were  done  on  several  different  machine  architectures, 
the  built-in  stopping  criteria  terminated  the  iterations  at  different  points  on  different  ma¬ 
chines.  To  establish  a  platform-independent  stopping  criteria,  the  L-BFGS-B  algorithm  was 
set  to  terminate  when 

I'old  tLnew  L  Itol  Ddd  (6.1) 

where  7 toi  was  selected  for  optimal  performance,  and  the  L-BFGS-B  algorithm  ensures 

I^old  h  Dnew- 

6.3  Data  Simulation 

Simulated  data  was  created  in  Mat  lab®  by  generating  independent  complex  circular 
Gaussian  variates  with  a  standard  deviation  equal  to  the  file-object  intensity  consistent  with 
(3.25).  The  resulting  complex  field  was  then  “propagated”  to  the  imaging  system  PP  using 
an  FFT  to  implement  far-held  diffraction.  The  PP  intensity  was  then  computed  by  taking 
the  magnitude  squared  of  the  held. 
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To  include  the  effects  of  atmospheric  turbulence,  temporally  uncorrelated  random 
phase  screens  were  generated  assuming  Kolmogorov  statistics  as  described  in  section  4.2 
using  200  Zernike  modes  and  applied  to  the  PP  field  for  the  FP  and  DP  data  channels.  An 
amount  of  defocus  phase  (Zernike  mode  4)  sufficient  to  introduce  0.5A  RMS  phase  diversity 
was  also  applied  the  PP  field  for  the  diversity-channel  data.  The  diversity  and  focal  channel 
fields  were  then  “propagated”  to  the  DP  and  FP  using  additional  FFTs.  The  intensities 
were  then  computed  by  taking  the  magnitude  squared  of  the  complex  field. 

To  account  for  detection  noise  processes,  both  Gaussian-distributed  read  noise  and 
Poisson  distributed  shot  noise  were  applied  to  the  data.  Gaussian  detector  noise  was  char¬ 
acterized  by  the  standard  deviation  a d-  The  average  detection  SNR  including  both  noise 
processes  is  then  defined  by 

SNR  =  P  (6.2) 

\JP  +  ad 

where  p  is  the  average  number  of  photo-electrons  (e_)  per  pixel  in  the  data  set.  To  generate 
data  with  a  desired  detection  SNR,  (6.2)  was  inverted  and  the  data  scaled  to  the  required 
average  p.  The  scaled  data  was  then  used  to  generate  Poisson-distributed  random  numbers 
with  mean  p.  Uncorrelated  Gaussian  random  numbers  with  a  standard  deviation  of  were 
then  added  to  each  pixel.  For  both  the  dynamic  and  static  aberration  cases,  the  <5-renroved 
PP-PSD  was  pre-computed  as  part  of  the  data  simulation  routine.  Before  being  passed  to 
the  solver,  and  any  negative  pixels  were  set  to  zero,  and  the  data  frames  were  scaled  such 
that  the  average  value  in  each  set  of  frames  (FP,  PP,  and  DP)  was  one.  Code  used  to 
generate  simulated  data  is  included  in  Appendix  B 

6.4  Performance  Metrics 

Because  of  the  large  number  of  data  realizations  inherent  in  a  Monte  Carlo  analysis, 
easily  computed  performance  metrics  are  key.  Although  there  exist  a  myriad  of  quality 
metrics  for  images,  they  are  often  subjective  and  require  human  interaction.  Rather  than 
delve  into  the  realm  of  choosing  an  “optimal”  quality  metric,  a  simple  mean-squared  error 
metric,  given  by 

mse  =  jp  -°(x>y))2  (6-3) 

x,y 
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where  o  is  the  truth  image,  and  6  is  the  resulting  image  estimate,  was  used  for  characterizing 
the  reconstructed  images.  While  it’s  easy  to  contrive  scenarios  where  this  metric  would  result 
in  a  poor  score  for  an  essentially  good  image,  most  (if  not  all)  simple  image  metrics  suffer 
from  similar  problems.  Furthermore,  in  the  few  scenarios  where  the  metric  might  show  a 
poor  reconstruction,  the  reconstructed  image  can  be  subjectively  evaluated. 

To  demonstrate  the  effectiveness  of  MSE  as  an  image  quality  metric,  reconstructions 
with  a  low  (1.83),  medium  (2.89),  and  high  (6.86)  MSE  are  shown  in  Fig.  6.1.  Fig.  6.1(a) 
shows  the  truth  image.  Fig.  6.1(b)  represents  a  good  reconstruction,  and  was  obtained 
with  50  frames  of  dynamically  aberrated  data,  ad  =  6e_,  SNR=7,  and  15  Zernike  modes. 
Fig.  6.1(c)  represents  an  average  reconstruction,  and  was  obtained  with  20  frames  of  aber¬ 
rated  data,  ad  =  6e_,  SNR=7,  and  15  Zernike  modes.  Fig.  6.1(d)  represents  a  poor  recon¬ 
struction,  and  was  obtained  with  10  frames  of  aberrated  data,  ad  =  6e_,  SNR=3,  and  15 
Zernike  modes.  The  image  clearly  becomes  worse  as  MSE  increases. 

The  other  key  performance  characteristics  considered  were  total  computation  time  and 
the  number  of  objective  function  evaluations  required  for  the  maximization  to  converge.  To 
the  extent  possible,  each  scenario  was  run  on  the  same  machine  architecture  to  facilitate 
comparison  of  computation  times.  However,  different  scenarios  were  run  on  several  differ¬ 
ent  machines,  making  comparison  of  computation  times  between  them  fruitless.  In  such 
scenarios,  comparison  of  function  evaluations  provides  a  better  metric. 

6.5  Aberration  Regularization 

Thelen’s  atmospheric  regularization  term  of  (4.79)  was  initially  implemented  as  part 
of  the  algorithm.  However,  introduction  of  the  regularization  term  decreased  neither  the 
computational  burden  nor  the  residual  error  in  the  reconstruction.  Because  it  had  a  net 
negative  effect  on  performance,  all  further  analysis  on  the  APDI  algorithm  was  done  with 
aberration  regularization  disabled. 

6.6  Optimal  Conditioning  Bias 

The  conditioning  bias  was  implemented  as  a  percentage  of  the  average  data  value.  To 
determine  optimal  conditioning  bias,  100  30-frame  data  sets  of  both  static-aberration  and 
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(a)  Truth 


(b)  MSE  =  1.83 


(c)  MSE  =  2.89 


(d)  MSE  =  6.76 


Figure  6.1:  (a)  Truth  image,  (b)  MSE=1.83,  (c)  MSE=2.89,  and  (d)  MSE=6.76.  (b) 

was  the  lowest  MSE  reconstruction,  (c)  an  average  MSE  reconstruction,  and  (d)  the  highest 
MSE  reconstruction  obtained  for  dynamic  aberrations  with  near-optimal  b  and  7 toi. 


dynamic-aberration  data  were  generated  using  SNR=7  and  =  6e~.  Each  data  set  was 
then  processed  with  7 toi  =  1  x  ICE9  for  static  aberrations,  and  7 toi  =  2  x  10~'  for  dynamic 
aberrations.  In  both  the  static-  and  dynamic-aberration  cases,  only  15  of  the  200  Zernike 
modes  were  included  in  the  reconstruction. 

Simulation  results  for  MSE  as  a  function  of  conditioning  bias  are  shown  in  Fig.  6.2. 
The  results  show  that  selecting  too  small  a  bias  results  in  worse  reconstructions.  While  a 
small  bias  produces  a  problem  that  is  closer  to  the  statistically  derived  expression,  the  ob¬ 
jective  becomes  poorly  conditioned,  and  the  results  are  worse.  Large  biases  also  worsen  the 
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reconstruction  by  moving  the  problem  away  from  the  statistics  for  which  it  was  developed. 
The  spike  in  MSE  for  b  =  10%  of  the  average  pixel  value  and  the  slight  rise  for  b  =  40%  seen 
for  dynamic  aberrations  in  Fig.  6.2(b)  are  unexplained.  However,  for  the  purposes  of  pro¬ 
ducing  low-MSE  images,  biases  of  b  =  10%  and  b  =  50%  appear  optimal  for  the  static  and 
dynamic  aberration  cases  respectively.  The  difference  between  optimal  biases  for  the  static 
and  dynamic  cases  likely  stems  from  the  noise  reduction  that  results  when  static-aberration 
data  frames  are  pre-averaged.  This  reduction  in  effective  noise  serves  to  better  condition 
the  problem  and  allow  for  a  smaller  bias  before  negative  impacts  are  seen. 


Residual  Error  vs.  Conditioning  Bias 


Bias 


MSE  vs.  Conditioning  Bias 


0  0.2  0.4  0.6  0.8  1 


Bias 


(a)  Static  Aberration 


(b)  Dynamic  Aberration 


Figure  6.2:  Residual  MSE  versus  conditioning  bias  b.  Residual  MSE  increases  for  small 
biases  as  the  problem  becomes  less  well-conditioned.  MSE  also  increases  for  larger  biases 
as  the  problem  moves  away  from  the  statistically  derived  objective. 


Computation  time  as  a  function  of  conditioning  bias  is  shown  in  Fig.  6.3.  For  both 
static  and  dynamic  aberrations  the  computation  time  drops  almost  exponentially  with 
increasing  bias.  As  the  bias  increases  and  the  problem  becomes  better  conditioned,  the 
solver  requires  fewer  and  fewer  iterations  to  converge  to  a  solution.  If  computation  time 
is  paramount,  a  large  decrease  in  time  can  be  obtained  for  a  relatively  small  increase  in 
residual  MSE  by  simply  increasing  the  conditioning  bias.  This  is  particularly  true  for  the 
dynamic  aberration  case. 
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Computation  Time  vs.  Conditioning  Bias 
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Figure  6.3:  Computation  time  versus  conditioning  bias  b.  Computation  time  decreases 

almost  exponentially  with  increasing  bias  as  the  problem  becomes  better  conditioned. 

6. 7  Optimal  Stopping  Tolerance 

With  an  established  conditioning  bias,  the  next  step  is  characterization  of  the  effect 
that  convergence  tolerance  has  on  performance.  The  effects  of  detection  SNR,  number  of 
frames,  size  of  the  data  array,  and  object  being  imaged  on  the  optimal  convergence  tolerance 
were  all  investigated. 

6.7.1  Detection  SNR.  The  effects  of  detection  SNR  on  the  optimal  stopping 
tolerance  were  quantified  by  generating  100  30-frame  data  sets  with  0^  =  6  and  SNR  equal 
to  2,  3,  5,  7,  10,  15,  20,  and  oo.  The  simulated  data  sets  were  then  post  processed  using 
15  Zernike  modes,  b  =  0.5  and  convergence  tolerances  ranging  from  =  1  x  10~10  to 
7 toi  =  1  x  10~4.  The  resulting  MSE  curves  are  shown  in  Fig.  6.4.  As  can  be  seen,  the 
only  significant  impact  of  detection  SNR  on  the  curves  is  a  shift  toward  higher  MSE  with 
decreasing  SNR  below  SNR=7. 

As  shown  in  Fig.  6.5  and  Fig.  6.6,  computation  time  and  function  evaluations  were 
only  moderately  impacted  by  variations  in  SNR. 

6.7.2  Number  of  frames.  To  investigate  the  effect  of  additional  data  frames  on 
optimal  stopping  tolerance,  100  data  realizations  with  10,  20,  30,  40,  and  50  frames  were 


Computation  Time  vs.  Conditioning  Bias 


(b)  Dynamic  Aberration 
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MSE  vs.  Convergence  Tolerance  MSE  vs.  Convergence  Tolerance 


Figure  6.4:  MSE  vs.  convergence  tolerance  and  SNR.  The  optimal  convergence  tolerance 
doesn’t  shift  significantly  with  changes  in  detection  SNR. 


Computation  Time  vs.  Convergence  Tolerance 


Computation  Time  vs.  Convergence  Tolerance 


(b)  Dynamic  Aberration 


Figure  6.5:  Computation  time  vs.  convergence  tolerance  and  SNR.  Computation  time 

shows  a  slight  dependence  on  detection  SNR. 
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Function  Evaluations  vs.  Convergence  Tolerance 


(a)  Static  Aberration 


Function  Evaluations  vs.  Convergence  Tolerance 


(b)  Dynamic  Aberration 


Figure  6.6:  Function  Evaluations  vs.  convergence  tolerance  and  SNR.  Function  evalua¬ 

tions  show  a  slight  dependence  on  detection  SNR. 


generated  with  SNR=7  and  a d  =  6e~  for  both  static  and  dynamic  aberrations.  These  data 
sets  were  then  post  processed  allowing  for  15  Zernike  modes  and  with  b  =  0.5.  Residual 
MSE  as  a  function  of  7 toi  is  shown  in  Fig.  6.7.  As  shown  in  Fig.  6.7(a),  there  is  no  penalty 
for  selecting  tolerance  values  smaller  than  the  optimal  value  of  7 toi  ~  6  x  10-8.  This  is  due  to 
the  fact  that  at  this  point,  the  solver  reaches  a  point  where  L0id  =  Lnew,  and  the  iterations 
stop  regardless  of  7 toi-  However,  for  the  dynamic  aberration  case  shown  in  Fig.  6.7(b), 
tolerances  smaller  than  7 toi  ~  2  x  10-7  result  in  increased  MSE. 

Another  notable  feature  of  Fig.  6.7(b)  is  that  the  relative  deviation  of  MSE  from  its 
optimal  value  caused  by  setting  7 toi  too  small  is  a  strong  function  of  the  number  of  data 
frames.  The  increase  in  MSE  as  7 toi  goes  lower  than  the  optimal  value  is  likely  because  the 
algorithm  begins  fitting  the  reconstruction  to  noise  in  the  data.  Visual  inspection  of  the 
results  support  this  hypothesis  by  showing  that  the  reconstructed  image  becomes  more  and 
more  speckled  as  the  solver  goes  beyond  the  optimal  result.  As  more  and  more  frames  are 
added  the  effect  of  noise  becomes  less  and  less  pronounced,  and  the  reconstruction  results 
in  a  smaller  deviation  from  optimum  MSE  values.  A  close  inspection  of  the  10  frame  line 
for  static  aberrations  in  Fig.  6.7(a)  shows  the  beginnings  of  the  same  rise  in  MSE  with 
decreasing  7 toi,  bearing  out  the  notion  that  the  rise  is  due  to  excess  noise  in  the  data. 
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Figure  6.7:  MSE  vs.  convergence  tolerance  for  10,  20,  30,  40,  and  50  frame  data  sets,  (a) 
The  number  of  frames  has  little  or  no  effect  of  optimal  'jtoi  for  static  aberrations.  There  is 
no  penalty  for  selecting  a  tolerance  smaller  than  the  optimum,  (b)  The  number  of  frames 
has  little  effect  on  the  optimal  tolerance  for  dynamic  aberrations  as  well,  with  only  a  slight 
shift  for  less  than  20  frames.  If  the  tolerance  is  set  too  low,  the  algorithm  begins  fitting  to 
noise  and  results  in  much  worse  reconstructions. 


The  number  of  function  evaluations  and  convergence  times  for  these  computations  are 
shown  in  Fig.  6.8  and  Fig.  6.9  respectively.  As  can  be  seen  in  Fig.  6.8(b)  and  Fig.  6.9(b), 
judicious  selection  of  convergence  tolerance  for  the  dynamic  aberration  case  not  only  signifi¬ 
cantly  reduces  the  MSE,  but  can  also  reduce  the  number  of  function  evaluations  and  associ¬ 
ated  computation  time  by  as  much  as  50%.  The  static  aberration  results  of  Fig.  6.8(a)  and 
Fig.  6.9(a)  indicate  that  as  much  as  a  25%  decrease  in  computational  cost  can  be  obtained 
for  no  more  than  a  10%  increase  in  MSE. 


6.7.3  Array  Size.  To  see  the  effect  of  detector  array  size  on  optimal  convergence 
tolerance,  a  128  x  128  pixel  object  was  resampled  to  64  x  64,  and  32  x  32  pixels.  Each  of 
the  objects  was  then  used  to  generate  100  30-frame  data  sets  for  both  static  and  dynamic 
aberrations  using  SNR=7  and  ad  =  6e^.  These  data  sets  were  then  post-processed  allowing 
for  15  Zernike  modes  and  setting  b  =  0.5.  The  resampled  objects  are  shown  in  Fig.  6.10,  and 
the  resulting  MSE  in  Fig.  6.11.  For  static  aberrations,  the  convergence  tolerance  appears 
to  be  essentially  insensitive  to  array  size,  but  for  dynamic  aberrations  the  optimal  tolerance 
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Function  Evaluations  vs.  Convergence  Tolerance  Function  Evaluations  vs.  Convergence  Tolerance 


Figure  6.8:  Function  evaluations  vs.  7 toi  and  frames  for  (a)  static  aberrations  and  (b)  dy¬ 
namic  aberrations.  The  number  of  frames  has  only  a  marginal  effect  on  optimal  convergence 
tolerance. 


Computation  Time  vs.  Convergence  Tolerance 


Computation  Time  vs.  Convergence  Tolerance 


(b)  Dynamic  Aberration 


Figure  6.9:  Computation  time  vs.  7 toi  and  frames  for  (a)  static  aberrations  and  (b) 

dynamic  aberrations.  The  number  of  frames  has  a  significant  impact  on  computation  time 
due  to  the  increase  in  computational  overhead  for  the  dynamic  aberration  case. 
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changes  by  as  much  as  two  orders  of  magnitude  between  the  128  x  128  and  32  x  32  pixel 
images,  generally  shifting  toward  larger  values  for  smaller  data  arrays. 


(a)  32  x  32  (b)  64  x  64  (c)  128  x  128 


Figure  6.10:  Truth  objects  used  for  evaluating  the  effect  of  object  size  on  optimal  con¬ 

vergence  tolerance. 

The  number  of  function  evaluations  and  computation  times  for  the  varying  object 
sizes  are  shown  in  Fig.  6.12  and  Fig.  6.13  respectively.  As  expected,  both  the  number  of 
function  evaluations  and  the  computation  time  are  strongly  dependent  on  the  object  size. 
As  the  array  size  increases,  the  number  of  unknowns  grow  and  the  optimization  routine 
takes  more  iterations  to  find  the  maximum.  In  addition  to  the  increased  number  of  function 
evaluations  required  for  convergence,  the  computational  cost  of  each  evaluation  scales  with 
the  image  size. 

6.7.4  Source  Object.  To  verify  that  the  optimal  convergence  tolerance  is  not 
impacted  by  the  source  object,  four  different  objects  (shown  in  Fig.  6.14)  were  each  used 
to  generate  100  30-frame  data  sets  with  SNR=7  and  07  =  6e_  for  both  static  and  dynamic 
aberrations.  These  data  sets  where  then  post-processed  using  15  Zernike  modes  and  b  =  0.5. 
Fig.  6.15  shows  that  while  the  absolute  MSE  varies,  the  location  of  the  optimum  convergence 
tolerance  is  essentially  independent  of  the  truth  object,  and  remains  within  a  factor  of  one 
to  two  for  all  the  tested  objects. 

Computation  time  and  function  evaluations  as  a  function  of  7 toi  for  the  test  objects 
are  shown  in  Fig.  6.16  and  Fig.  6.17.  The  number  of  function  evaluations,  and  consequently 
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MSE  vs.  Convergence  Tolerance  MSE  vs.  Convergence  Tolerance 


(a)  Static  Aberration  (b)  Dynamic  Aberration 

Figure  6.11:  MSE  vs.  convergence  tolerance  for  the  file  objects  of  Fig.  6.10  for  (a)  static 
aberrations,  and  (b)  dynamic  aberrations.  Optimal  convergence  tolerance  for  static  aber¬ 
rations  appears  essentially  insensitive  to  the  size  of  the  data  array,  while  the  dynamic 
aberration  case  shows  a  strong  dependence. 


Function  Evaluations  vs.  Convergence  Tolerance 


(a)  Static  Aberration 


Function  Evaluations  vs.  Convergence  Tolerance 


(b)  Dynamic  Aberration 


Figure  6.12:  Function  evaluations  vs.  7 toi  and  array  size  for  (a)  static  aberrations  and  (b) 
dynamic  aberrations.  As  expected,  the  increase  in  unknowns  associated  with  larger  arrays 
has  a  significant  impact  on  the  number  of  iterations. 
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Function  Evaluations  vs.  Convergence  Tolerance 


Function  Evaluations  vs.  Convergence  Tolerance 


(b)  Dynamic  Aberration 


Figure  6.13:  Computation  Time  vs.  7 toi  and  array  size  for  (a)  static  aberrations  and  (b) 

dynamic  aberrations.  As  expected,  the  increase  in  unknowns  associated  with  larger  arrays 
has  a  significant  impact  on  computation  time. 


the  computation  time,  show  a  significant  dependence  on  the  object  being  imaged.  However, 
the  optimal  tolerance  doesn’t  appear  to  be  similarly  impacted. 


6.8  Corrected  Modes 

To  understand  the  trade-offs  inherent  in  compensating  additional  aberration  modes, 
100  sets  of  30-frame  data  were  generated  using  SNR=7  and  0^  =  6.  These  data  sets 
were  then  post-processed  with  b  =  0.5,  and  7 t„i  =  1  x  10~9  for  static  aberrations  and 
7 toi  =  2  x  10-7  for  dynamic  aberrations.  Each  data  set  was  processed  allowing  for  5, 
10,  15,  20,  30,  50,  and  100  compensated  Zernike  modes.  The  resulting  MSE  is  shown  in 
Fig.  6.18.  In  both  cases,  MSE  drops  rapidly  as  more  and  more  modes  are  compensated  until 
approximately  30  modes  are  included.  After  this  point,  MSE  rises  again.  The  reduction  in 
MSE  is  consistent  with  the  results  expected  as  the  model  of  the  blurring  aberration  becomes 
more  complete.  However,  by  the  time  50  modes  are  included,  the  mode  variance  for  high- 
order  modes  is  approximately  four  orders  of  magnitude  below  that  of  the  low-order  modes, 
and  the  algorithm  is  likely  fitting  to  noise  in  the  data. 

As  shown  in  Fig.  6.19  and  Fig.  6.20,  the  number  of  corrected  modes  has  a  dramatic 
effect  on  the  computation  time,  and  to  a  lesser  extent  on  the  number  of  function  evaluations 
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(a)  OBJ  1 


(b)  OBJ  2 


(c)  OBJ  3  (d)  OBJ  4 

Figure  6.14:  Truth  objects  used  for  evaluating  the  effect  of  the  object  on  optimal  conver¬ 
gence  tolerance. 
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Figure  6.15:  MSE  vs.  convergence  tolerance  for  the  file  objects  of  Fig.  6.14  for  (a)  static 
aberrations,  and  (b)  dynamic  aberrations.  Optimal  convergence  tolerance  appears  essen¬ 
tially  insensitive  to  the  object  being  imaged. 


Function  Evaluations  vs.  Convergence  Tolerance  Function  Evaluations  vs.  Convergence  Tolerance 


(a)  Static  Aberration 


(b)  Dynamic  Aberration 


Figure  6.16:  Function  evaluations  vs.  7 toi  and  truth  object  for  (a)  static  aberrations  and 

(b)  dynamic  aberrations.  The  truth  object  has  a  moderately  strong  impact  on  the  number 
of  evaluations  required  for  convergence. 
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Computation  Time  vs.  Convergence  Tolerance  Computation  Time  vs.  Convergence  Tolerance 


(a)  Static  Aberration 


(b)  Dynamic  Aberration 


Figure  6.17:  Computation  time  vs.  7 toi  and  truth  object  for  (a)  static  aberrations  and 

(b)  dynamic  aberrations.  The  truth  object  has  a  moderately  strong  impact  on  computation 
time. 


Residual  Error  vs.  Modes 
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(a)  Static  Aberration 


(b)  Dynamic  Aberration 


Figure  6.18:  MSE  initially  drops  with  increasing  modes,  then  slowly  climes  beyond  30 

modes  for  (a)  static  aberrations,  and  (b)  dynamic  aberrations. 
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required  for  convergence.  Because  a  great  deal  of  the  computational  time  is  spent  comput¬ 
ing  the  gradients  with  respect  to  the  aberration  parameters,  the  increase  in  computation 
time  is  expected.  The  increase  in  function  evaluations  can  be  attributed  to  the  increased 
dimensionality  of  the  parameter  space.  The  dip  in  function  evaluations  for  100  modes  with 
static  aberrations  is  likely  due  to  the  optimizer  terminating  iterations  prematurely. 


Function  Evaluations  vs.  Modes 


Function  Evaluations  vs.  Modes 


(a)  Static  Aberration 


(b)  Dynamic  Aberration 


Figure  6.19:  Function  evaluations  vs.  modes  for  (a)  static  aberrations  and  (b)  dynamic 

aberrations.  Increasing  the  number  of  included  modes  initially  improves  MSE  at  the  cost 
of  a  moderate  increase  in  function  evaluations. 


6.9  Number  of  Frames  and  Dominant  Detection  Noise 

To  characterize  the  effects  of  the  number  of  data  frames  and  type  and  severity  of 
detection  noise  sources,  100  static  and  dynamic  aberration  data  sets  with  10,  20,  30,  40, 
and  50  frames  were  generated  with  SNR  values  of  2,  3,  5,  7,  10,  15,  and  20  using  both 
cjrf  =  6e_  and  =  100e~.  The  static  aberration  data  sets  were  post  processed  using  15 
Zernike  modes,  b  =  0.1  and  jtol  =  1  x  10~8.  Dynamic  aberration  data  sets  were  post- 
processed  using  15  Zernike  modes,  b  =  0.5,  and  7 toi  =  2  x  10~'.  Results  for  the  static 
aberration  case  are  shown  in  Fig.  6.21  and  Fig.  6.23.  Dynamic  aberration  results  are  shown 
in  Fig.  6.22  and  Fig.  6.24. 

For  both  the  static  and  dynamic  aberration  data  sets,  there  was  little  difference  be¬ 
tween  the  Poisson  dominated  ( ad  =  6e_)  and  Gaussian  dominated  (ad  =  100e“)  detection 


85 


(a)  Static  Aberration  (b)  Dynamic  Aberration 

Figure  6.20:  Computation  time  vs.  modes  for  (a)  static  aberrations  and  (b)  dynamic 

aberrations.  Increasing  the  number  of  included  modes  initially  improves  MSE  at  the  cost  of 
a  significant  increase  in  computation  time  caused  by  the  computational  burden  of  including 
the  extra  modes  in  the  system  model. 

noise.  The  slight  improvement  seen  for  the  static-aberration  10-frame  Poisson-dominated 
case  is  likely  due  to  the  fact  that  for  low  signal  levels,  Poisson  statistics  are  closer  than 
Gaussian  statistics  to  the  exponential  distribution  for  which  the  algorithm  was  derived. 
While  working  with  static  aberrations,  there  is  little  benefit  gained  by  adding  more  than 
20  frames  to  the  dataset.  Adding  additional  frames  to  the  dynamic  aberration  data  sets 
has  a  more  significant  effect.  Though  the  magnitude  of  the  MSE  reduction  tapers  off,  there 
continues  to  be  a  significant  reduction  through  50  frames. 

Another  notable  feature  is  the  overall  lower  MSE  and  tighter  error  bars  obtained  with 
dynamic  aberration  data  compared  with  static  aberration  data.  This  can  be  understood 
by  considering  incoherent  phase  diversity  imaging  [39]  and  multi-frame  blind  deconvolution 
[42].  With  moderate  to  strongly  aberrations,  multi-frame  blind  deconvolution  tends  to 
produce  better  results  than  phase  diversity.  With  static  aberrations,  the  APDI  algorithm  is 
essentially  an  incoherent  phase  diversity  algorithm  with  additional  data  included.  When  the 
aberration  changes  in  time,  the  system  begins  to  look  more  like  a  multi-frame  deconvolution 
with  the  associated  increase  in  performance. 
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MSE  v.s.  Detection  SNR,o  =6  MSE  v.s.  Detection  SNR,  o  ,=100 

d  d 


(a)  ad  =  6e  (b)  ad  =  100e 

Figure  6.21:  Static  aberration  MSE  vs.  SNR  for  10,  20,  30,  40,  and  50  frame  data  sets,  (a) 
represents  Poisson  and  (b)  Gaussian  dominated  detection  noise.  For  SNR  >  7  there  is  little 
improvement  in  MSE,  and  little  is  gained  by  averaging  more  than  20  frames  of  data.  Low 
SNR  performance  appears  better  for  Poisson  than  Gaussian  dominated  noise,  but  results  in 
similar  results  for  higher  SNR  values. 


LU 

CO 


MSE  v.s.  Detection  SNR,  a  =6e 
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MSE  v.s.  Detection  SNR,  a  =100e' 
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SNR  SNR 


(a)  ad  =  6e  (b)  ad  =  lOOe 

Figure  6.22:  Dynamic  Aberration  MSE  vs.  SNR  for  10,  20,  30,  40,  and  50  frame  data  sets, 
(a)  represents  Poisson  and  (b)  Gaussian  dominated  detection  noise.  For  SNR  >  7  there  is 
little,  if  any,  improvement  in  MSE.  Results  for  the  Poisson  and  Gaussian  dominated  cases 
are  statistically  indistinguishable.  Adding  extra  frames  to  the  dataset  initially  decrease 
MSE  rapidly,  but  the  reduction  tapers  off  for  more  than  approximately  30  frames. 
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Time  v.s.  SNR, a  =6  Time  v.s.  SNR, a  =1 00 

a  a 


(a)  od  =  6e  (b)  <jd  =  100e 


Figure  6.23:  Static  aberration  computation  time  vs.  SNR.  Aside  from  the  10  frame  case, 

additional  frames  have  little  or  no  impact  on  computation  time. 
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Figure  6.24:  Dynamic  aberration  computation  time  vs.  SNR.  SNR  has  only  a  slight  impact 
on  computation  time.  The  overall  scale  difference  between  the  a ^  =  6e_  and  =  100e_ 
cases  is  because  the  two  scenarios  were  run  on  different  computer  architectures. 
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6.10  Impact  of  PP  and  DP  Data  Planes  on  Reconstruction 

One  of  the  most  radical  changes  from  conventional  phase-diversity  imaging  was  the 
introduction  of  PP  data.  Because  introduction  of  an  additional  data  channel  implies  ad¬ 
ditional  hardware  and  the  lower  light  levels  associated  with  splitting  off  another  channel, 
it’s  important  to  quantify  the  improvement  offered  by  adding  the  new  channel.  To  test 
the  effects  of  the  PP  data,  100  data  realizations  were  generated  with  30  frames,  SNR=7, 
and  cr d  =  6e_.  The  resulting  datasets  were  then  processed  with  15  compensated  Zernike 
modes,  b  =  0.5,  and  'jtoi  =  1  x  IIP9  and  7 toi  =  2  x  10-7  for  static  and  dynamic  aberrations 
respectively  using  the  full  APDI  algorithm,  the  APDI  algorithm  including  just  conventional 
FP  and  DP  data,  and  with  the  APDI  algorithm  using  just  FP  and  PP  data.  The  resulting 
MSE  as  a  function  of  data  planes  is  shown  in  Fig.  6.25. 
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Figure  6.25:  Residual  MSE  for  the  full  algorithm,  without  the  DP  data,  and  without  the 
PP  data  for  (a)  static  aberrations,  and  (b)  dynamic  aberrations. 


Because  of  the  large  amount  of  information  in  the  PP  intensity  that  hasn’t  been 
corrupted  by  the  unknown  atmosphere,  the  PP  data  has  a  much  more  significant  impact 
on  system  performance  than  the  conventional  DP  data.  For  both  the  static  and  dynamic 
aberration  cases,  the  residual  MSE  was  only  slightly  higher  for  the  FP-PP  configuration 
than  it  was  for  the  complete  FP-DP-PP  configuration.  Reconstructions  done  without  PP 
data  included  resulted  in  148%  and  54%  increases  in  average  MSE  compared  with  the  Full 
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APDI  algorithm  for  the  dynamic  and  static  aberration  cases  respectively.  This  shows  that 
the  PP  data  has  a  dramatic  impact  on  overall  performance  of  the  reconstruction. 

Exclusion  of  the  DP  data  from  the  APDI  algorithm  resulted  in  only  a  11%  and  24% 
increase  in  MSE  for  the  static  and  dynamic  aberration  cases  respectively.  In  cases  where 
low-light  levels  force  a  poor  SNR,  the  marginal  gain  in  performance  obtained  by  using 
all  three  data  planes  may  be  swamped  by  the  decrease  in  performance  caused  by  a  low 
detection  SNR.  This  indicates  that  a  two-channel  system  comprised  of  a  FP  and  PP  sensor 
might  be  optimal  for  these  kinds  of  scenarios.  Even  when  SNR  isn’t  a  factor,  the  additional 
computational  burden  and  hardware  requirements  for  a  complete  three-channel  system  may 
not  be  justified  by  the  marginal  gain  in  system  performance. 

As  shown  in  Fig.  6.26  and  Fig.  6.27,  in  addition  to  the  gain  in  low-light  performance, 
elimination  of  the  DP  data  for  the  dynamic  aberration  case  also  serves  to  reduce  the  com¬ 
putation  time  required  for  convergence.  Because  the  PP  data  is  insensitive  to  the  changing 
aberration,  it  can  be  pre-averaged,  and  much  of  the  computational  burden  lifted.  However, 
the  DP  data  depends  on  the  atmosphere,  and  a  reasonably  large  amount  of  computation 
must  be  performed  at  every  iteration  to  include  this  channel.  Because  of  this  disparity, 
the  FP-PP  configuration  converges  faster  than  the  other  configurations  in  spite  of  requiring 
more  function  evaluations.  For  the  static-aberration  case,  the  data  frames  are  pre-averaged 
in  all  channels,  and  the  computational  burden  for  the  DP  data  becomes  less  than  that  of 
the  PP  data. 

6.11  Overall  Performance 

Of  interest  is  the  reduction  in  MSE  relative  to  the  raw  frame-averaged  FP  intensity. 
Ideally,  the  reconstruction  should  be  universally  better  than  the  raw  data,  and  result  in 
a  lower  MSE  for  the  reconstruction  than  for  the  averaged  raw  data.  The  pre-  and  post- 
processed  MSE  for  all  of  the  14,000+  realizations  used  to  generate  Fig.  6.21  and  Fig.  6.22 
are  shown  in  the  scatter  plot  of  Fig.  6.28.  The  solid  line  marks  the  point  where  the  pre- 
and  post-processed  MSE  are  equal.  For  the  static  aberration  case,  approximately  3%  of 
the  reconstructions  result  in  an  MSE  greater  than  the  pre-processed  MSE.  As  the  averaged 
focal  plane  data  becomes  worse  and  worse,  the  frequency  of  poor  reconstructions  increases. 
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Function  Evaluations  vs.  Regularization  Planes 


Configuration 


(a)  Static  Aberration 


(b)  Dynamic  Aberration 


Figure  6.26:  Function  evaluations  vs.  data  planes  for  (a)  static  aberrations  and  (b)  dy¬ 

namic  aberrations.  For  the  dynamic  aberration  case,  elimination  of  the  DP  data  reduces 
the  computational  burden.  The  same  effect  is  not  seen  for  static  aberration  data. 


Configuration 


Computation  Time  vs.  Regularization  Planes 


(a)  Static  Aberration 


(b)  Dynamic  Aberration 


Figure  6.27:  Computation  time  vs.  data  planes  for  (a)  static  aberrations  and  (b)  dynamic 
aberrations.  For  the  dynamic  aberration  case,  elimination  of  the  DP  data  reduces  the 
computational  burden.  The  same  effect  is  not  seen  for  static  aberration  data. 
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However,  in  many  of  the  cases  where  post-processed  MSE  is  higher  than  the  averaged  FP 
MSE,  visual  inspection  of  the  reconstruction  shows  that  these  points  are  a  result  of  the 
error-metric  being  inadequate.  In  a  few  cases,  the  question  of  whether  the  or  not  the 
reconstruction  is  worse  than  the  raw  data  becomes  subjective,  and  it  might  be  argued  that 
the  reconstruction  diverged.  For  the  dynamic  aberration  case,  all  of  the  reconstructions 
result  in  an  MSE  that  is  lower  than  the  frame-averaged  FP  MSE  in  spite  of  the  fact  that 
the  averaged  FP  data  typically  had  a  higher  MSE  than  was  the  case  for  static  aberrations. 


Post-Processed  MSE  vs.  Average  FP  Data  MSE  Post-Processed  MSE  vs.  Average  FP  Data  MSE 


Average  FP  Data  MSE  (au)  Average  FP  Data  MSE  (au) 


(a)  Static  Aberration  (b)  Dynamic  Aberration 

Figure  6.28:  Pre-  and  post-processed  MSE  for  7,000+  (a)  static  aberrations,  and  (b) 

dynamic  aberrations.  The  solid  line  marks  the  point  where  pre-  and  post-processed  MSE’s 
are  equal.  Approximately  3%  of  the  reconstructions  with  static  aberrations  had  a  higher 
MSE  than  the  raw  data.  The  prevalence  of  high-MSE  reconstructions  increases  as  the 
averaged  FP  MSE  increases.  None  of  the  dynamic  aberration  reconstructions  resulted  in  an 
MSE  that  was  higher  than  the  raw  data. 


The  lowest,  average,  and  highest  MSE  reconstructions  for  static  aberration  data  are 
shown  in  Fig.  6.29,  Fig.  6.31,  and  Fig.  6.33  respectively.  The  lowest,  average,  and  high¬ 
est  MSE  reconstructions  for  dynamic  aberrations  are  shown  in  Fig.  6.30,  Fig.  6.32,  and 
Fig.  6.34.  The  best  reconstruction  from  static  aberration  data  has  the  lowest  MSE  of  any 
reconstruction  done  as  a  part  of  this  work,  represents  a  fairly  benign  aberration,  and  could 
be  considered  a  “glory  shot”  for  the  system. 
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(a)  Truth  (b)  FP  Data  (MSE=4.65)  (c)  Result  (MSE=1.67) 

Figure  6.29:  Lowest  MSE  reconstruction  from  static  aberration  data.  Generated  with 

SNR=20,  30  frames,  and  =  6e~.  (a)  Truth  object,  (b)  Frame-averaged  FP  data  with 
MSE=4.65.  (c)  15-mode  reconstruction  with  MSE=1.67 


(a)  Truth  (b)  FP  Data  (MSE=6.59)  (c)  Result  (MSE=1.83) 

Figure  6.30:  Lowest  MSE  reconstruction  from  dynamic  aberration  data.  Generated  with 
SNR=7,  50  frames,  and  ad  =  6e_.  (a)  Truth  object,  (b)  Frame-averaged  FP  data  with 
MSE=6.59.  (c)  15-mode  reconstruction  with  MSE=1.83 
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(a)  Truth  (b)  FP  Data  (MSE=5.54)  (c)  Result  (MSE=3.77) 

Figure  6.31:  Average  MSE  reconstruction  from  static  aberration  data.  Generated  with 

SNR=3,  20  frames,  and  cr^  =  100e-.  (a)  Truth  object,  (b)  Frame-averaged  FP  data  with 
MSE=5.54.  (c)  15-mode  reconstruction  with  MSE=3.77 
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(a)  Truth  (b)  FP  Data  (MSE=6.69)  (c)  Result  (MSE=2.79) 

Figure  6.32:  Average  MSE  reconstruction  from  dynamic  aberration  data.  Generated  with 
SNR=2,  50  frames,  and  a d  =  100e_.  (a)  Truth  object,  (b)  Frame-averaged  FP  data  with 
MSE=6.69.  (c)  15-mode  reconstruction  with  MSE=2.79 
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(a)  Truth  (b)  FP  Data  (MSE=7.29)  (c)  Result  (MSE=10.18) 

Figure  6.33:  Worst  MSE  reconstruction  from  static  aberration  data.  Generated  with 

SNR=2,  10  frames,  and  a d  =  100e-.  (a)  Truth  object,  (b)  Frame-averaged  FP  data  with 
MSE=7.29.  (c)  15-mode  reconstruction  with  MSE=10.18 


(a)  Truth  (b)  FP  Data  (MSE=6.90)  (c)  Result  (MSE=6.76) 

Figure  6.34:  Worst  MSE  reconstruction  from  dynamic  aberration  data.  Generated  with 

SNR=3,  10  frames,  and  <7^  =  6e_.  (a)  Truth  object,  (b)  Frame-averaged  FP  data  with 
MSE=6.90.  (c)  15-mode  reconstruction  with  MSE=6.76 
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VII.  Polarization  Data  in  Active  Imaging 

In  addition  to  conventional  intensity  measurements  used  for  the  APDI  algorithm,  the 
spatially  resolved  polarization  state  of  the  incident  light  can  be  measured,  and  might 
potentially  provide  valuable  information  for  use  in  an  image  reconstruction  algorithm.  This 
chapter  covers  the  development  of  three  approaches  to  using  PP  polarization  data  in  image 
reconstruction.  First,  a  statistical  description  for  PP  polarization  data  was  developed  and 
an  EM-algorithm  developed  to  reconstruct  the  image  by  estimating  the  field  correlation 
matrix  and  relating  it  to  the  original  object  intensity.  While  this  is  the  most  direct  and 
mathematically  rigorous  approach  to  using  PP  polarization  data,  the  derived  algorithm 
proved  to  be  far  too  complex  for  implementation. 

As  an  alternative,  PP  polarization  data  was  mathematically  shaped  to  fit  into  the  basic 
structure  of  the  APDI  algorithm,  and  minor  modifications  were  made  to  generate  a  lensless 
APDI  (LAPDI)  variant.  Given  a  good  starting  guess,  the  LAPDI  algorithm  converged  to  a 
good  reconstruction  in  a  few  iterations.  When  a  random  starting  guess  was  used  however, 
the  algorithm  stalled  at  a  local  maximum  after  one  iteration,  resulting  in  a  reconstruction 
that  was  useless.  In  addition  to  converging  to  false  reconstructions,  the  LAPDI  algorithm 
is  extremely  computationally  expensive  and  would  require  significant  changes  in  order  to  be 
practical. 

Finally,  a  statistical  description  of  the  polarization  phase  was  developed  with  the 
intent  of  replacing  the  PP  imaging  sensor  with  a  four-channel  imaging  polarimeter.  The 
relationship  between  the  polarization  phase  PSD  and  the  truth  object  was  established, 
and  an  approximate  numerical  form  developed.  Though  there  is  a  significant  amount  of 
information  about  the  truth  object  encoded  in  the  PP  polarization  phase,  the  developed 
implementation  resulted  in  worse  reconstructions  and  had  a  higher  computational  burden. 

7.1  Direct  Image  Estimation  From  Pupil  Plane  Polarization  State  Measurements 

The  first  question  considered  is  the  feasibility  of  reconstructing  an  image  directly  from 
the  spatially  resolved  PP  polarization  state.  For  the  purposes  of  model  development,  rough 
surface  scattering  is  assumed  to  be  the  dominant  factor  in  depolarization,  and  statistical 
independence  between  polarizations  is  assumed.  Similar  to  the  statistical  description  of 
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laser  speckle,  it’s  assumed  that  the  size  of  a  scattering  surface  on  the  object  is  sufficiently 
small  that  contributions  from  a  large  number  of  scatterers  are  observed  at  all  points  located 
any  significant  distance  from  the  object.  After  reflection  from  the  object,  a  linear,  homoge¬ 
neous,  isotropic  propagation  media  is  assumed,  and  scalar  diffraction  theory  is  used  to  treat 
polarization  components  independently. 

Treating  polarizations  independently,  the  polarized  portion  of  the  field  in  the  PP  is 
completely  described  by  projecting  it  against  the  primary  s  and  p  polarization  directions 

Us  =Ase>0'  (7.1a) 

Up  =Apej{6s+9l2)  (7.1b) 


where  0S  is  the  unknown  propagation  phase  for  the  s  polarization,  As  and  Ap  are  the  PP 
amplitudes  for  the  two  polarizations,  and  0\2  is  the  known  polarization  phase.  Assuming 
the  spatially  resolved  Stokes  vector  is  measured,  the  available  data  are  given  by 


As  —\J  ^  (So  +  ^i) 
^=V5(S»_Sl) 

012  =  arctan  — - 
^3 


(7.2a) 


(7.2b) 

(7.2c) 


where  as  before 


S'0  ^Sf  +  SI  +  SI 

=vs„ 


(7.3a) 

(7.3b) 


and  each  of  As,  Ap.  and  0i2  are  N  x  N  data  arrays. 

To  use  statistical  methods  to  estimate  the  lost  phase  information,  an  adequate  sta¬ 
tistical  model  for  the  data  must  be  developed.  For  the  linear- algebra  approach  used  here, 
two-dimensional  data  arrays  must  first  be  re-ordered  or  “lexicographically  stacked”  [1]  into 
an  N2  x  1  column  vector  which  is  built  up  by  “raster  scanning”  the  image  and  dumping  the 
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elements  sequentially  into  the  output  vector  such  that 
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(7.4) 


Once  the  data  has  been  restructured,  a  statistical  model  can  be  formed  using  familiar  linear 
algebra  notation. 

From  Chapter  III,  the  statistics  in  both  the  object  and  detector  planes  obey  circular 
complex  Gaussian  statistics,  described  by 


P(U)  =  - ^ — ^exp  {--UHT~lu\  (7.5) 

where  U  is  the  field  component  along  either  polarization  direction  and  T  is  the  associated 
correlation  matrix  where  the  ( i,j)th  element  is  E  [U*  (i)U  (j)],  which  can  also  be  interpreted 
as  the  mutual  intensity  Ji2(Pi,  Pj )  of  the  field  components  at  the  points  represented  by  the  i 
and  j  indexes  in  the  PP.  To  specify  the  joint  probability  of  the  two  polarization  components, 
independence  is  assumed,  and  the  probability  density  functions  for  each  component  are 
multiplied.  This  results  in  a  joint  probability  distribution  function  given  by 

P(U"  Ur)  =  ^vWiFT^H  +  U”Tr'Uv)}  (™) 


Given  that  As,  Ap,  and  9 12  are  known  to  within  the  uncertainty  of  noise  in  the  data, 
it  makes  sense  to  separate  these  quantities  out  of  (7.6)  and  isolate  the  common  phase  6S. 
To  do  this,  individual  field  elements  Us{i)  =  As(i)ePa W  and  Up(i)  =  Ap(i)e^ds^ e^12^  can 
be  written  Us(i)  =  As(i)t)(i)  and  Up(i)  =  where  i9(i)  =  eJ'0sW.  In  the  vector 


98 


representation  this  can  be  written 


U8  =AS$  (7.7a) 

Up  =A  pd  (7.7b) 

where  9  is  an  N 2  x  1  vector  defined  by 

■d  =  ej9s  (7.8) 

and 

As  =  diag  {AJ  (7.9a) 

Ap  =  diag  {Ap}  diag  je^12 1  (7.9b) 

where  diag{-}  is  a  diagonal  matrix  composed  of  elements  of  the  argument.  The  exponent 
of  (7.6)  can  then  be  written 


UfT-'Us  +  U^T-]UP  = 


(7.10) 


where 

M  =  Af  r7xAs  +  A^T-1  Ap  (7.11) 

Using  these  simplifications,  the  probability  distribution  of  •&,  As,  Ap,  and  9 12  given  Ts,  and 
rp  is  given  by 


P(9Sl  As,  Ap,  9i2\Ts,Tp) 


1 


exp 


(7.12) 


7.1.1  Phase  Estimation  With  a  Known  Correlation  Matrix.  After  completing  the 
statistical  system  model,  the  next  step  is  to  construct  an  estimation  rule  that  will  result 
in  the  best  possible  reconstruction  of  the  object  being  imaged.  One  approach  is  to  form 
an  estimate  of  the  phase  and  reconstruct  the  image  through  inverse-transforming  the  now 
complete  PP  field  distribution.  Alternatively,  the  correlation  matrices  and  Tp  can  be 
estimated  and  used  to  get  an  estimate  of  the  object  intensity  distribution  by  noting  that 
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the  (i,j)th  element  of  T  is  the  mutual  intensity  for  the  field  at  the  points  represented  by 
the  i  and  j  indexes.  Once  an  estimate  for  the  mutual  intensity  has  been  formed,  the  object 
can  be  reconstructed  using  (3.31)  via  an  inverse  Fourier  transform. 

Assuming  the  full  conditional  probability  is  available,  the  best  possible  estimate  for 
the  lost  phase  information  can  be  found  by  forming  the  ML  estimate  in  accordance  with 
section  2.1.3.  The  ML  estimation  rule  for  the  complex  phasor  d  is  given  by 

=  argmaxP(t?|As,  Ap,  6*12)  (7.13) 

where  d  is  the  ML  estimate  of  d.  To  eliminate  exponentials,  the  log-likelihood  is  maximized 
such  that 

d  =  argrnin  [d^Md]  (7-14) 

i9 

A  quick  look  at  the  form  of  (7.14)  shows  it  to  be  an  Ar2-dimensional  quadratic  that  is 
centered  on  zero,  implying  that  the  corresponding  minimum  would  be  located  at  the  point 
d  =  0.  However,  the  definition  of  d  from  (7.8)  places  constraints  on  the  possible  solutions, 
and  results  in  non-trivial  estimates. 

In  a  previous  implementation  of  an  estimator  for  d  [20],  a  constraint  was  imposed  that 
required  ||d||2  =  N  or  otherwise  expressed 

d^d  =  fV2  (7.15) 

Applying  this  constraint, 

d  =  argrnin  [daMd]  (7-16) 

tfHtf=N2 

Using  the  constrained  optimization  method  outlined  in  section  2.2.4,  the  optimum  estimate 
for  d  is  found  by  solving  the  eigenvalue  problem 

Md  =  Azd  (7.17) 

and  setting  the  estimate  d  equal  to  the  eigenvector  associated  with  the  minimum  eigenvalue. 
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While  the  constraint  given  by  (7.15)  gives  the  problem  a  convenient  form  and  limits 
the  set  of  possible  solutions  to  a  set  including  the  possible  valid  values  of  i?,  it  also  includes 
a  much  larger  set  of  possible  solution  vectors  that  do  not  correspond  to  valid  estimates. 
Given  that  i?  is  a  unit-magnitude  complex  phasor,  the  correct  constraint  function  for  the 
optimization  can  be  written 

90)  =  l  -V*$i  (7.18) 

where  is  the  ith  element  of  the  vector  valued  function  <7(1?),  i?,  is  the  ith  element  of 

1?,  and  *  denotes  complex  conjugation.  Using  this  constraint  function,  the  Lagrangian  for 
optimization  is  given  by 

L(V,\l)  =  #HMV  +  \lg(H)  (7.19) 

which  is  nonlinear  and  requires  special  care.  One  approach  to  solving  the  correctly  con¬ 
strained  problem  was  presented  by  Szeto  [47],  and  is  paraphrased  here. 

Given  that  piston  (a  constant  phase  across  the  reconstruction)  is  an  unobservable 
quantity  that  does  not  directly  impact  the  image  reconstruction,  it’s  constrained  to  be  zero 
to  eliminate  any  ambiguity  in  the  reconstruction  result.  To  implement  this,  the  solution  is 
constrained  such  that 

n 

Y  Im{di}  =  0  (7.20) 

2=1 

To  add  this  constraint  to  the  Lagrangian,  an  additional  multiplier  /q  is  added,  resulting  in 

L($,  A u  m)  =  +  A Jg{'&)  +  ml T Im{&}  (7.21) 

where  1  is  an  lV2-vector  of  ones  and  m  is  a  scalar  multiplier. 

To  eliminate  the  need  for  complex  algebra,  d  is  written  in  terms  of  its  real  and  imag¬ 
inary  parts  such  that 

■&  =  u  +  jv  (7.22) 

and  M  is  split  into  its  real  and  imaginary  parts  such  that 


A  =%{M} 
B  =Im{  M} 


101 


(7.23a) 

(7.23b) 


where  A  is  symmetric  and  B  is  skew-symmetric.  Substituting  these  definitions  into  (7.21) 
and  simplifying  results  in  the  modified  Lagrangian 

L(u,  v,  A i,  Hi)  =  u1  ( Au  —  Bv)  +  vT(Av  +  Bu)  +  Xf  g( u,  v)  +  /qlTv  (7.24) 

At  this  point  it’s  convenient  to  define 

A  =  diag  |  A;  |  (7.25) 

as  a  diagonal  matrix  built  from  the  vector  of  Lagrange  multipliers,  and  write  g{ u,  v)  in  a 
form  more  convenient  for  linear  algebra  operations 

<7(11,  v)  =  1  —  (Uu  +  Vv)  (7.26) 

where  U  and  V  are  diagonal  matrices  such  that 

U=diag{u}  (7.27a) 

V  =  diag  {v}  (7.27b) 

Differentiating  (7.24)  with  respect  to  u,  v,  A 1,  and  hi  and  setting  the  results  equal  to 
zero  produces  the  system  of  equations 


Au  —  Bv  =  —  Au 

(7.28a) 

Av  +  Bu  =  —  Av  —  Hi  1 

(7.28b) 

Uu  +  Vv  =1 

(7.28c) 

O 

II 

> 

F=l 

(7.28d) 

the  solution  of  which  will  yield  the  optimum  estimate  for  1?  which  can  then  be  used  in 
conjunction  with  the  remaining  data  to  form  an  image.  Szeto  showed  that  for  his  application, 
this  system  of  equations  converged  quadratically  using  Newton’s  method. 


102 


7.1.2  Correlation  Matrix  Estimation.  One  critical  assumption  in  the  previous 
approach  is  that  the  correlation  matrix  T  is  known.  Alternatively,  assuming  that  the  prop¬ 
agation  phase  is  known,  the  correlation  matrix  can  be  estimated  using  the  complex  field. 
Considering  (3.31)  and  ignoring  scaling  and  phase  terms,  T  is  only  a  function  of  the  separa¬ 
tion  (A u,  Av)  between  the  two  points  being  correlated.  Assuming  that  both  the  amplitude 
and  phase  information  are  available,  the  correlation  matrix  can  be  estimated  using  the  fact 
that  there  are  multiple  data  pairs  within  a  data  set  with  the  same  separation.  An  effi¬ 
cient  method  for  forming  this  estimate  is  to  use  FFT’s  to  generate  the  autocorrelation  of 
the  N  x  N  complex  field  arrays  assuming  circular  shifts,  then  using  the  autocorrelation  to 
populate  the  T  matrix  based  on  point  separations. 

Unfortunately,  the  phase  information  is  not  available.  What  is  available,  however,  is 
the  polarization  phase.  To  consider  the  relationship  between  the  propagation  phase  and  the 
polarization  phase,  the  phases  at  two  points  Pi  and  P2  are  written 


0]  =S-polarization  phase  @  Pi 

(7.29a) 

Op  =P-polarization  phase  @  Pi 

(7.29b) 

02  =S-polarization  phase  @  P2 

(7.29c) 

Op  =P-polarization  phase  @  P2 

(7.29d) 

The  polarization  phases  are 

then  given  by 

012  =°l  ~ 

(7.30a) 

r\2  r\2  r\2 

“l2  —^s  Vp 

(7.30b) 

Evaluating  the  polarization  phase  correlation  in  terms  of  the  propagation  phases 

e  [e\2e\2] 

=E[(ol-e1p)(o2s-o2p )] 

(7.31a) 

_ tt'  \r\lr\2  /q  1  /a2  n2nl  ,  nln2~\ 

-lj  \y su s  &sVp  VsUp  +  c/pt/pj 

(7.31b) 

=  E  [0l02\  -  E  [ 0\0l ]  -  E  [0s20i]  +  E  [0y;] 

(7.31c) 
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Assuming  statistical  independence  of  orthogonal  polarizations,  E  \9\6^\  =  0  and  E  \02s9^\  = 
0  resulting  in 

EM=E[^s2]+E[«X]  (7.32) 

which  when  E  [9j92]  =  E  [9p92]  is  twice  the  average  of  the  phase  correlation  for  each 
polarization.  If  the  two  polarizations  have  similar  reflectivities  the  correlations  will  be  very 
similar,  and  the  polarization  phase  correlation  will  closely  match  the  true  phase  correlation. 

Given  the  relationship  between  polarization  phase  correlation  and  propagation  phase 
correlation,  and  that  the  polarization  phase  correlation  is  easily  estimated  from  measured 
data,  the  next  logical  step  is  to  relate  the  phase  correlation  to  the  field  correlation.  Given 
the  two-point  joint  phase  probability  distribution  function  of  (3.45)  the  correlation  between 
the  phases  9 i  and  92  at  points  Pi  and  P2  is  given  by 

E  [9i92]  =  I 0i92  (/Jsin"1  /?  +  ^  +  y/\  -  (32^  d9^92  (  7.33) 

where  fic  is  the  complex  correlation  coefficient,  fv  =  |/zc|,  (3  =  /iCOs(#2  —  9\  +  fi),  and 

=  Z/rc-  Unfortunately,  (7.33)  does  not  lead  to  a  closed-form  relationship  between  E  [9\92] 
and  fic.  Numerical  evaluation  of  (7.33)  over  the  range  of  /r  and  results  in  Fig.  7.1,  which 
shows  the  predicted  strong  dependence.  However,  it  can  be  seen  that  the  correlation  is 
not  invertible,  having  multiple  possible  values  of  fic  for  a  given  value  of  P[9i92].  As  a 
consequence,  direct  estimation  of  fic  or  equivalently  T  from  polarization  data  using  this 
approach  is  not  possible. 

7.1.3  Iterative  Refinement.  Given  that  there  are  two  unknowns  in  the  statistical 
model,  a  common  approach  to  generating  an  estimate  is  to  optimize  the  first  unknown  given 
a  current  estimate  of  the  second,  then  update  the  second  based  on  the  results  of  the  first. 
In  a  previous  implementation  developed  at  the  Air  Force  Research  Laboratory  (AFRL)  [20] 
this  method  was  used  to  iteratively  refine  estimates  for  the  common  phase  term  and  the 
correlation  matrix.  Given  an  initial  estimate  for  the  correlation  matrix,  an  estimate  for  the 
phasor  D  is  formed  then  used  with  the  remaining  data  to  develop  a  better  estimate  for  the 
correlation  matrix.  This  process  is  then  iterated  in  an  attempt  to  converge  to  the  optimal 
solution. 
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Figure  7.1:  Functional  dependence  of  E  [6\ 02]  on  the  magnitude  and  phase  of  the  field 

correlation  jic. 


7.1.4  EM- Algorithm  Correlation  Matrix  Estimation.  A  more  mathematically  rig¬ 
orous  approach  than  alternating  estimation  of  the  two  unknowns  is  to  develop  an  EM- 
algorithm  in  accordance  with  section  2.1.4.  The  EM  algorithm  is  explicitly  designed  to  deal 
with  estimation  of  parameters  when  there  is  some  form  of  unobservable  data;  and  has  been 
shown  to  converge  to  a  local  maximum  in  general,  and  to  the  true  maximum  likelihood 
estimate  if  the  likelihood  function  is  convex  over  the  space  of  possible  solutions  [15] .  In  the 
problem  of  interest,  the  correlation  matrices  Ts  and  rp  are  the  parameters  to  be  estimated, 
the  common  phase/phasor  $  is  the  unobservable  or  hidden  data,  and  As,  Ap,  and  6 12  are  the 
incomplete  or  measured  data.  In  contrast  to  the  AFRL  implementation,  7}  is  not  estimated. 
Rather,  estimates  of  the  parameters  1%  and  rp  are  developed  using  the  measured  data  and 
the  statistical  description  of  the  system. 


In  this  case,  the  complete  data  are  described  by  the  joint-density  function  of  (7.12), 


or 


p{6s,As,Ap,e12\rs,rp)  = 


which  leads  to  a  log-likelihood  function  given  by 


exp<j  --/Mil 


(7.34) 


T(rs,rp|tf,  As,  Ap,0i2)  = 


(7.35) 
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where  as  in  (7.11) 


(7.36) 


m  =  Af  r^A,  +  Af  r;1  Ap 

and  where  As  and  Ap  are  as  defined  in  (7.9).  The  conditional  expectation  over  which  the 
maximization  is  performed  is  then  given  by 


Q(M,MW)  =  E  L(rs,rp|M)|M(i) 


=  -E 


0hM0|MW 


(7.37a) 

(7.37b) 

(7.37c) 


J&e® 

where  0  is  the  space  of  all  unit-magnitude  complex  phasors. 

To  complete  the  expectation,  the  conditional  probability  distribution  p(i?|MW)  must 
be  known.  Using  Bayes’  rule,  this  conditional  distribution  is  given  by 


p(0»|M«) 


p(9s,  As,  Ap,  0i2|Ts,  Tp) 
p(As,  Ap,  ^?i2 1  r  s ,  rp) 


(7.38) 


however,  the  marginal  distribution  p{As,  Api9\2\T s,Tp)  for  this  problem  is  not  easily  ob¬ 
tainable.  As  was  previously  mentioned,  because  the  marginal  distribution  used  for  the 
expectation  is  not  a  function  of  the  updated  parameter  estimate,  it’s  effectively  constant 
and  the  expectation  can  be  performed  using  p(j9,  As,  Ap,  6?i2 |TS,  Tp).  Applying  this  and 
dropping  the  constant  multiplier  (with  the  exception  of  the  negative  sign),  the  expectation 
becomes 

Q( M,M(i))  =  -  f  'i?i/Mtfexp{--tfHMW'd}d'd  (7.39) 

J&e®  l  2  J 

Continuing  to  the  maximization  step  of  the  EM  algorithm,  the  updated  estimate  for  M  is 
then  given  by 

M(i+1)  =  argmaxQ(M,Mw)  (7.40) 

M 


7.1.5  Sparse  Solution  Approach.  In  order  to  implement  this  EM- algorithm,  the 
N  x  N  data  set  must  be  re-ordered  into  an  N 2  x  1  vector,  and  then  the  N 2  x  N 2  correlation 
matrix  must  be  formed.  For  large  images  this  can  result  in  a  problem  that  is  too  large  to  solve 
directly.  For  example,  a  256  x  256  image  would  lead  to  a  65536  x  65536  correlation  matrix 
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requiring  32 Gb  of  memory  to  store  in  64-bit  double-precision  floating  point  format.  Given 
that  the  addressable  memory  on  a  32-bit  processor  is  capped  at  4 Gb,  the  required  space  for 
simply  storing  the  correlation  matrix  puts  the  problem  out  of  the  reach  of  many  currently 
available  computers.  Additionally,  the  computation  time  involved  in  directly  operating  on 
such  a  large  matrix  would  be  significant. 

Based  on  the  definition  of  T,  the  correlation  matrix  takes  on  a  Toeplitz-block-Toeplitz 
form.  Furthermore,  because  the  magnitude  of  the  correlation  depends  only  on  the  absolute 
separation  between  the  points,  the  magnitude  of  each  block  will  be  identical  with  only  a 
known  phase  difference  between  them.  This  introduces  a  large  amount  of  redundancy  that 
can  substantially  reduce  the  storage  requirements  for  T.  If  the  solution  algorithm  can  be 
customized  to  take  advantage  of  the  structure  and  redundancy  in  the  matrix,  the  storage 
and  computational  requirements  can  be  reduced  significantly. 

One  approach  to  utilizing  this  redundancy  is  to  pose  the  problem  as  sets  of  two  point 
data  pairs  characterized  by  a  common  point  separation.  This  can  be  done  without  loss  in 
generality  because  of  the  Gaussian  statistics  which  are  completely  characterized  by  their 
first  and  second  moments.  An  estimator  can  be  built  for  each  point  separation,  and  used  to 
incrementally  build  up  an  estimate  for  the  full  correlation  matrix.  To  do  this,  the  statistical 
model  is  modified  to  only  include  the  joint  two-point  distribution  of  intensity  and  phase 
given  by  (3.43)  re-written  here  as 


P(h,i2,e1,e2) 


i 

4/27T2(l-^2)eXP 


h  +  h  -  2y/hhn cos  (6\  —  62  + ip) 
/(1-M2) 


(7.41) 


where  ju  =  |//c|,  ip  =  Z/rc,  and  /  is  the  average  intensity  in  the  detector  plane. 

Assuming  independence  of  orthogonal  polarizations  and  defining  the  fields  along  the 
s-  and  p-polarizations  as 


U„  =Af 


JSs 


(7.42a) 


the  joint  pdf  for  any  set  of  data  pairs  from  each  polarization  is  given  by 


P(ilJl,ixvjlelele\2e 


12  J 


exp 


1 

(4tt2IsIp)2(1  -  (ps)2)(l  -  (fip)2)  X 
I,1  +  /2  -  2  v/ipf/x*  cos  (0]  -  6l  +  r) 

U  i  -  (/u)2) 


Ilp  +  ip  —  2 cos  (el  -  e\2  -  el  +  e\2  +  r) ' 
ipO- —  (/^p)2) 


(7.43) 


Point  pairs  with  the  same  separation  are  treated  independently,  and  the  joint  distribution 
for  N  point  pairs  with  set  separations  is  given  by 


>  %  » -?■?  >  ^ ^  >#12 ,  ^12)  =  H P(J«  (n)  ’  (n) »  Tp  W >  7p ' W >  W »  0?  (n) .  #12  (n)  6'l2  («)  ) 


(7.44) 

This  distribution  can  then  be  used  to  develop  a  more  tractable  EM-algorithm  for  estimation 
of  pc.  To  simplify  development  of  an  EM-algorithm  for  this  distribution,  only  one  point 
pair  is  used  at  present.  The  result  will  be  expanded  later  to  include  multiple  pairs. 


Using  the  joint  PDF  of  (7.43)  and  throwing  out  terms  that  are  not  functions  of  the 
correlation  coefficients,  the  CD  log-likelihood  function  is  given  by 


T(^,^)  =  -ln{(l-(^)2)(l-(Mp)2)}- 


Ij  + 12  -  2  y/T^fis  cos  (el  -  el  +  r) 


Is{i  -  Ou)2) 

$  +  Ip-‘2  sJ^Jjep  cos  (el  +  e\2  -e2s-  e\2  +  v) 

^>(1 —  (/^p)2) 


(7.45) 


To  carry  out  the  expectation  of  (2.16),  the  conditional  PDF  for  dl,  and  02,  given  ij,  /2,  Ip, 
Ip,  0\2,  and  6f2  must  be  found  using  Bayes’  rule  such  that 


p(el,el\il,il,il,i 


^12  ? ^12) 


p(il,ilii,r2el,ele\2e2 


12) 


p(il,ih  ip,  ip,  e\2e\ 


(7.46) 


12 ) 


which  requires  the  joint  marginal  distribution  of  the  known  parameters  I l ,  I2,  Ip,  Ip,  9\2, 
and  e\2. 
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Because  the  fields  are  independent,  the  marginal  distributions  for  each  polarization 
will  also  be  independent  and  can  be  found  individually  and  multiplied  to  find  the  desired 
quantity  rather  than  attempting  to  integrate  (7.43)  directly.  Doing  this  and  temporarily 
ignoring  the  constants  in  front  of  the  exponential  in  (7.41),  the  desired  marginal  distributions 
are  given  by 


and 


/£W=wyexp 


/f  +  /f 


2  \/Whis cos  i°l  -  7  +  v) 

U  i  -  (Ps)2) 


de]M 

(7.47) 


P(IpiIpiO  12)  ^12)  — 


exp 


r  r  1 

JL  472tt2(1  -  (fj,p)2)  X 

IP1  +  IP2  -  2  VW2PV  cos  (^  -  o2  +  0\2 


7p(l  (Mp)2) 


012  +  r) 


del&ol 


(7.48) 


Using  the  results  of  Equation  (2.94)  in  [12],  these  integrals  are  the  same,  and  ultimately 
have  no  dependence  on  0\2  and  022,  with  the  final  result  for  the  joint-marginal  PDF  given 
by 


p{Ils,llllll0\2922)  = 


exp 


(Is)2(/p)2(1-(Ms)2)(1-(^)2) 


1 1  + 12  + 1[  + 12 

7jp(l  -  (/r,)2)(l  -  (^p)2) 


Io 


4(1  -  (Ps)2) 


Io 


( \ 

(4(1  -  M2) ) 


(7.49) 


where  Jo(-)  is  a  modified  Bessel  function  of  the  first  kind,  zero  order. 

Because  the  conditional  distribution  is  conditioned  on  the  current  estimate  for  the 
parameter  /rc  in  the  expectation  step,  this  marginal  distribution  is  a  constant,  and  can  be 
ignored.  The  expectation  can  be  taken  using  the  complete  joint-probability  given  by  (7.43) 
such  that 


Q(Pc  /A 


(old)\  _ 


L(pMel,  0'iJl  i2, 1L12  e\2,  o22\^oldld0lM 


(7.50) 
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Applying  (7.43)  and  (2.15)  to  (7.50),  assuming  for  simplicity  that  the  correlation 
coefficients  for  each  polarization  are  the  same,  and  pulling  all  factors  that  are  independent 
of  the  phase  terms  9\  and  02  out  of  the  integral,  the  expectation  is  then  given  by 

Q(Mc,/4oW))  =  /i(m)  +  /2(m^(oW))/3W  (7.51) 


where  the  term  of  current  interest  is 


MVO  = 


cos  (el  -  e2s  +V0  +  Jiffi  cos  (el  -  e2  +  o\2  -  o\2  +  v>) 


exp 


^jij2cos(el-e2  +  ^)+ 


i^cos(e1s-e2  +  e112-e22  +  ^)  d^2  (7.52) 


Because  f±  and  fi  are  everywhere  nonnegative,  (7.51)  can  be  maximized  over  the  space  of 
possible  correlation  coefficients  by  maximizing  fa  with  respect  to  -0  first,  then  using  this 
result  in  (7.51)  and  maximizing  over  ju.  To  expand  the  problem  to  include  multiple  data 
pairs,  independence  of  point  pairs  is  assumed,  and  the  integral  of  becomes 


/'(0l,0l,^)exPr/'(0l,@l,^) 


dflfdflf  (7.53) 


J0K i)  JOHN)  J0*(1)  J0*{N) 

where  there  are  twice  as  many  integrals  as  there  are  point  pairs  with  the  specified  separation, 


N 

n=  1 

\Jlpin)Ipin)  cos  (elin)  -6l{n)  +  0[2{n)  -  0?2(ra)  + 


(7.54) 


T* ,  I~,  Ip ,  if,  6\ 2,  and  fff2  are  vectors  containing  the  polarization  phase  for  all  point 
pairs  with  a  given  separation,  and  the  integration  is  over  all  the  unknown  phase  variables. 

Because  this  integral  has  no  known  solution,  numerical  integration  was  considered. 
However,  assuming  for  simplicity  that  the  nested  integrals  could  be  separated,  a  20  frame 
64  x  64  pixel  image  maximization  of  (7.51)  at  all  point  pair  separations  would  require 
approximately  1.3  xlO9  numeric  integrations  per  iteration.  This  also  assumes  that  the  result 
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of  (7.51)  has  a  well  defined  maximum  on  the  interval  (—7 r  <  ij)  <  ir\  which  may  or  may  not 
be  the  case.  Because  of  the  extreme  cost  of  forming  this  estimate,  further  work  on  an  EM 
algorithm  for  PP  polarization  data  was  halted  at  this  point. 


7.2  lensless  APDI 

Another  approach  to  reconstructing  images  from  PP  polarization  state  data  is  to 
mathematically  shape  the  problem  to  look  like  a  phase-diversity  (PD)  problem  and  generate 
a  lensless  APDI  (LAPDI)  algorithm.  Recall  that  the  fields  in  the  system  PP  immediately 
after  the  effective  lens  for  a  PD  system  were  described  by 


UFP  =  t/0  ex p  [j(, 

UDp  =  U0  exp  [j  (cj>  +  4>d)\ 


(7.55a) 

(7.55b) 


where  U0  =  A  exp  [j 6 \\.  0\  is  the  vacuum  propagation  phase,  (f>  is  the  unknown  aberration 
phase  and  (f)p  is  the  known  diversity  phase.  If  all  data  is  taken  in  the  PP,  the  system  is 
insensitive  to  the  aberration  phase,  and  the  aberration  phase  can  be  arbitrarily  set  such 
that  cj)  =  —  9S.  In  this  case,  the  PP  field  could  be  written 


Upp  =  A 

UDP  =  A  exp  \j(pn\ 


(7.56a) 

(7.56b) 


The  FP  and  DP  intensities  would  then  be  given  by 

dFP(x\F{A}\2  (7.57a) 

dDP  oc  |JP{Aexp  [j  (4>d)]}\2  (7.57b) 

Allowing  the  amplitudes  of  (7.56)  to  have  channel- dependent  noise,  (7.56)  becomes 

UFp  =  As  (7.58a) 

UDp  =  Ap  exp  \jcj)D\  (7.58b) 
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Comparing  (7.58)  and  (7.2),  the  data  available  from  PP  polarization  state  field  can 
completely  describe  the  field  if  we  let  4>d  =  @12-  The  field  description  for  each  polarization 
is  simply  the  square-root  of  the  intensity.  The  operations  in  (7.57)  required  to  generate  an 
aberrated  image  can  then  be  mathematically  performed,  and  the  APDI  algorithm  can  be 
modified  to  work  with  only  polarization  data.  The  updated  noiseless  model  data  is  given 
by 

9Sm  =  0®sL  (7-59) 

where  m  E  { FP ,  DP},  and 

sfFP  =  |.F{exp  [-j(j>]} |2  (7.60a) 

sdp  =  |b^{exP  [— J  (0  +  6»12)]}|  (7.60b) 


The  “measured  data”  for  the  FP  and  DP  channels  is  defined  as 


dfFP{x,y )  =71  P{Afs} 
dfDP{x,y )  =72  HAl} 


(7.61a) 

(7.61b) 


where  71  and  72  are  normalization  constants,  Al  and  Ap  are  the  measured  field  amplitudes 
for  the  principle  polarizations  in  the  fth  frame,  and  0(2  is  the  polarization  phase  for  the  fth 
frame.  To  account  for  the  non-zero  average  of  As  and  Ap,  it  is  subtracted  out  before  the 
model  images  are  formed.  Another  notable  change  is  that  the  “diversity  phase”  is  now  a 
part  of  the  data  set,  and  changes  with  each  frame.  The  modulus  squared  of  A{  and  Ap  can 
be  passed  into  the  APDI  PP-regularization  term  without  additional  changes. 

The  only  major  change  to  algorithm  development  is  modification  of  the  aberration 
parameterization.  Because  the  aberrations  being  considered  for  the  APDI  algorithm  were 
somewhat  smooth  and  had  a  power  spectrum  that  was  predictable,  a  reduced  representation 
for  them  could  be  easily  found  and  the  aberration  represented  with  only  a  few  parameters. 
However,  when  the  unknown  phase  is  the  vacuum  propagation  phase,  the  complex  structure 
of  the  phase  makes  projecting  it  against  Zernike  modes  counterproductive.  A  suitable 
representation  would  require  roughly  as  many  parameters  as  there  are  pixels,  and  numerical 
roundoff  and  aliasing  of  the  high-order  modes  would  adversely  impact  the  representation. 
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As  an  alternative,  the  natural  basis  set  (shifted  5-functions)  is  used,  making  each  phase  pixel 
in  each  frame  a  variable  of  optimization.  Other  than  expanding  the  variable  space,  the  only 
change  required  is  in  the  computation  of  the  objective  function  gradient  with  respect  to 
phase  parameters. 

Restating  (5.18)  and  (5.19)  for  the  partial  derivative  of  gm  with  respect  to  phase 
parameter  (j>(uQ,vo)  and  substituting  -i/>n  =  5(u  —  uq,  v  —  vq), 

=  o(x,y)®»t(x,y)  (7.62) 

where 


l{x,  y)  =  2%  j h*m(x,  y)F^j5{u  -u0,v-  v0)P(u ,  v )  x 

j(y^APm<j>p(u,v)  +  <t>{u,v))  8(u-uq,v-  vp))  |  (7.63) 


exp 


(3m  =  1  when  m  =  DP  and  0  otherwise,  and  the  aberration  phase  has  not  been  param¬ 
eterized.  Assuming  P(u,  v)  =  1  for  all  points  in  the  data  array,  using  the  definition  for 
the  discrete  Fourier  transform,  and  employing  the  sifting  property  of  the  5-function,  (7.63) 
becomes 


—  ( u0x  +  v0y )  +  f3Dcf>D(uo,  v0)  +  </>(u0,  u0) 

(7.64) 

One  major  consequence  of  shifting  from  a  truncated  Zernike  mode  representation  to 
a  pixel- value  representation  for  phase  is  that  the  computational  costs  skyrocket.  For  the 
original  APDI  implementation,  each  iteration  requires  (6M  +  12)  F  +  5  FFTs,  where  there 
are  F  data  frames,  and  M  Zernike  modes  are  compensated.  This  translates  to  3065  FFTs 
per  iteration  for  a  30-frame  data  set  of  any  size  with  15  modes  per  frame.  By  contrast,  the 
LAPDI  algorithm  requires  (41V2  +  12 )F  +  5  FFTs  for  each  iteration  using  an  N  x  N  data 
set,  translating  to  roughly  5  x10s  FFTs  per  iteration  for  a  30-frame  64  x  64  pixel  data  set, 
and  2xl06  for  a  128  x  128  pixel  30-frame  data  set. 

Because  of  the  extreme  computational  cost,  a  16  x  16  test  image,  shown  in  Fig.  7.2(a), 
was  used  for  initial  algorithm  performance  analysis.  The  reconstruction  shown  in  Fig.  7.2(b) 


Sm{x,y)  =  Jj2lm\  (h*m(x,y))*  exp\  -j 
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resulted  from  using  the  truth  for  the  starting  guess,  and  converged  after  six  iterations.  Given 
the  inherently  noisy  data,  the  reconstruction  is  quite  good.  However,  when  starting  with  a 
random  guess,  the  algorithm  stalled  at  a  local  maximum  after  one  iteration.  The  resulting 
reconstruction  bore  no  resemblance  to  the  original  object.  To  get  a  better  starting  guess, 
the  polarization  phase  PSD  was  used  for  the  truth  object  and  uniform  random  numbers 
were  generated  for  the  phase  guess.  Again,  the  optimization  stalled  after  one  iteration.  The 
starting  guess  and  reconstruction  are  shown  in  Fig.  7.3(a)  and  Fig.  7.3(b)  respectively. 


Figure  7.2:  LAPDI  (a)  Truth  object /initial  guess,  and  (b)  reconstruction.  The  algorithm 
converged  after  6  iterations. 


2  4  6  8  10  12  14  16 


(a)  Guess 


2  4  6  8  10  12  14  16 

(b)  Reconstruction 


Figure  7.3:  LAPDI  (a)  Initial  guess  from  0\2  PSD,  and  (b)  reconstruction.  The  algorithm 
stalled  after  1  iteration. 
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The  failure  to  converge  is  likely  due  to  the  fact  that  there  are  only  3FN2  data  points 
for  (F  +  1)N2  unknowns,  a  ratio  of  slightly  less  than  three  to  one  for  any  realistic  N 
and  F.  While  this  should  be  sufficient  in  a  nearly  noiseless  system,  speckle  noise  caused 
by  laser  statistics  creates  local  minima  that  trap  the  optimization  routine.  Because  each 
additional  frame  introduces  N2  additional  unknowns  while  only  adding  3N2  data  points,  the 
problem  won’t  be  significantly  better  conditioned  by  simply  adding  frames.  An  alternative 
approach  must  be  taken  to  regularize  the  objective  if  this  approach  is  to  work.  Because  of 
its  prohibitively  expensive  nature  and  failure  to  converge  to  satisfactory  solutions,  further 
work  on  the  LAPDI  algorithm  was  terminated. 


7.3  Polarization  Enhanced  AD  PI 


An  implication  of  the  relationship  between  the  polarization  phase  correlation,  phase 
correlation,  (3.35)  and  Fig.  7.1,  is  that  there  is  a  large  amount  of  information  about  the 
object  encoded  in  the  polarization  phase  measurements.  An  alternative  approach  to  incor¬ 
porating  this  information  into  an  image  reconstruction  algorithm  is  to  replace  the  PP  sensor 
in  the  APDI  system  described  in  Chapter  V  with  an  imaging  polarimeter.  The  intensities 
for  the  two  polarizations  can  then  be  used  as  PP  intensities  in  the  APDI  algorithm  without 
modification.  The  remaining  step  is  to  form  a  statistical  description  for  the  polarization 
phase  that  will  fit  in  the  framework  of  the  APDI  algorithm. 

Rather  than  work  directly  with  the  (unknown)  joint  PDF  of  the  polarization  phase, 
the  spatial  PSD  of  the  polarization  phase  is  used.  Though  there  is  no  known  analytic 
expression  for  the  distributions  of  the  PSD  or  autocorrelation,  the  definition  of  the  PSD  as 
the  modulus  square  of  the  Fourier  transform  can  be  used  to  develop  an  approximate  form. 
As  before,  the  Fourier  transform  is  written  in  integral  form,  and  the  power  spectrum  of  the 
polarization  phase  is  given  by 


di2  (x,y)  = 


$i2(tt,  v)  exp  [—j 2ir  (ux  +  vy)\  dudu 


(7.65) 


Assuming  for  simplicity  that  the  9 12  values  are  independent  and  viewing  the  integral  as  an 
infinite  sum  of  randomly  weighted  unit-magnitude  complex  phasors,  the  central  limit  theo¬ 
rem  indicates  that  the  integral  will  be  a  complex  Gaussian  random  variable.  Applying  the 
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same  transformations  used  for  (3.28),  the  power  spectrum  will  be  exponentially  distributed. 
As  with  the  PP-PSD  data,  a  histographic  analysis  agrees  with  exponential  statistics.  To 
maintain  tractability,  spatial  independence  is  again  assumed. 

As  an  indication  of  the  object  information  contained  in  d\2,  the  truth  object  and  100 
averaged  simulation  realizations  are  shown  in  Fig.  7.4.  To  relate  d±2  to  the  object  being 
imaged,  define 

//  (A£,  Art)  =  E  [6»i2  (£i,  771)  #12  (6,  m)]  (7-66) 

where  E  [6*12  (£1, 771)  $12  (£2,  772)]  is  a  function  of  fi  and  ij),  and  is  shown  in  Fig.  7.1.  Applying 
the  Weiner-Khintchine  theorem  to  relate  the  autocorrelation  and  the  PSD  such  that  the 
noiseless  model  data  is  given  by 


012  (x,  y)  =  T{y!  (A£,  At?)} 


(7.67) 


where  7/  is  as  given  in  (7.66).  As  previously  stated,  /ic  is  a  normalized  Fourier  transform  of 


the  object  given  by 


and  /i  =  \nc\  and  ^ 


_  F{o(x,y)} 

“c  £{,„•>  (f.i) 


(7.68) 


(a)  Truth  (b)  Average  PSD 

Figure  7.4:  (a)  Truth  object,  and  (b)  100  averaged  polarization  phase  PSD  realizations. 

It’s  clear  that  a  large  amount  of  object  information  is  encoded  in  the  polarization  phase 
PSD. 
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Statistical  independence  between  data  frames  is  assumed,  and  the  joint  distribution 


for  the  polarization  phase  PSD  is  given  by 


1  d12  ( x,y )' 

.912  (x,  y)  gv2{x,y)_ 


(7.69) 


leading  to  a  log-likelihood  given  by 


(7.70) 


The  only  remaining  pieces  required  for  incorporation  into  the  APDI  algorithm  are  develop¬ 


ment  of  a  method  for  efficiently  determining  g'  (/u.,  -0) ,  and  computation  of  the  gradient  of 
gi2  with  respect  to  the  object  pixels  and  aberration  parameters.  Because  the  polarization 


phase  is  insensitive  to  the  aberration,  the  partial  derivative  with  respect  to  the  aberration 
parameters  is  zero. 

Initial  attempts  to  fit  g'  (g,  ip)  to  analytical  expressions  including  Gaussian,  Lorentzian, 
and  cosine  series  didn’t  produce  a  good  fit.  Furthermore,  evaluation  of  the  partial  deriva¬ 
tives  of  the  fit  were  unacceptably  computationally  expensive.  As  an  alternative,  g!  and  its 
partial  derivatives  with  respect  to  g  and  ip  were  numerically  evaluated  by  integrating  (3.45) 
and  its  derivatives  on  a  linearly  spaced  256  x  256  grid  covering  the  ranges  (—7 r  <  ip  <  n) 
and  (0  <  g  <  1).  The  results  of  the  numeric  integration  are  shown  in  Fig.  7.5.  The  map¬ 
ping  between  gc  and  g' ,  and  the  gradients  of  g!  with  respect  to  g  and  i p  were  obtained  by 
performing  a  2-D  linear  interpolation  using  nearest  neighbor  points. 

The  remaining  pieces  required  for  integration  are  the  partial  derivatives  of  g  and  ip 
with  respect  to  a  given  object  pixel.  Using  the  definition  of  the  discrete  Fourier  transform, 


(7.71) 


and  defining 


v-  =  0*/u)1/2 


(7.72) 
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¥  (rad)  0  (1  ¥  (rad)  0  ^  ¥  (rad) 


(a)//  (y.VO  (b)  (M,V9  (c  )  5^m'(m>V0 

Figure  7.5:  Numerical  evaluation  of  //  (/x, t/>),  dy' /dy,  and  dy’ /ip.  2-D  linear  interpolation 
is  used  for  /x  and  ip  values  that  don’t  correspond  to  grid  points. 


and 


ip  =  tan 


-l  ( Im{Vc} 


^3<e{yc} 

the  partial  derivatives  of  y  and  ip  with  respect  to  object  pixel  o  ( xa ,  yQ )  are  given  by 


(7.73) 


cos  (  ^  ( x0x  +  y0y)  +  ip  (x,  y)  )  -  y  (x,  y) 


and 


dy  (x,  y)  =  1 

do  (x0,  y0)  °  (f  > r?)  L~“  V  ^ 


dip(x,y)  _  1  -sin  {x0x  +  y0y)  +  ip{x,y)) 


(7.74) 


(7.75) 


do  (xo,  y0 )  E^0^*7?)  /^(a:,y) 

Finally,  to  produce  the  required  partial  derivatives  (7.74)  and  (7.75)  were  passed  through  a 
Fourier  transform  such  that 


<9.912  (x,  y)  =  T  j  <V  dy  +  3//'  dp) 


do(x0,yo)  \dydo(x0,y0)  dip  do  (x0,  y0) 


(7.76) 


This  regularization  term  was  coded  and  included  in  a  modified  Polarization  Enhanced 
(POLE)  APDI  algorithm  using  only  PP  and  FP  measurements.  Because  the  regularization 
term  requires  0(N2)  computations  per  iteration  whereas  the  remaining  terms  are  computed 
using  FFTs  with  0(N  log  N)  computations,  the  regularization  is  rather  expensive.  To  de¬ 
termine  if  the  cost  of  this  regularization  term  is  worthy  of  further  investigation,  100  sets  of 
30-frame  data  were  generated  with  a  D/ro  =  15  and  assuming  perfect  detection,  then  pro- 
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cessed  with  and  without  polarization  phase  regularization  and  without  DP  data.  Results  of 
the  simulations  are  shown  in  Table  7.1.  Introduction  of  the  polarization  phase  term  causes 
a  significant  increase  in  both  computation  time  and  MSE. 

Table  7.1:  Performance  comparison  between  POLE  and  FP-PP  APDI 


Metric 

POLE  APDI 

FP-PP  APDI 

MSE 

4.1032 

1.7865 

Time  (s) 

5523.2 

304.37 

Function  Eval’s 

181 

367 

Even  though  it  appears  that  much  information  is  contained  in  the  regularization  term, 
this  implementation  has  an  overall  negative  effect.  The  approximations  made  for  the  func¬ 
tional  form  of  fjf  and  its  gradients  likely  caused  deviations  from  the  desired  behavior.  If 
this  form  of  regularization  is  to  be  used,  a  better  statistical  relationship  between  the  object 
and  the  polarization  phase  must  be  found.  If  an  inexpensive  analytic  solution  were  found, 
the  error  caused  by  approximation  would  be  reduced  and  computational  burden  could  be 
less  of  a  factor.  However,  because  the  APDI  algorithm  performs  exceptionally  well  as  is, 
the  trade-off  in  signal  strength  implied  by  addition  of  a  4-channel  detector  in  the  PP  would 
likely  drown  out  any  benefit  gained. 

7-4  Future  Directions 

In  order  to  reconstruct  images  from  only  PP  polarization  data,  an  alternative  approach 
must  be  found.  The  computational  burden  associated  with  the  methods  developed  here 
make  them  impractical.  Ideally,  the  EM-algorithm  approach  should  be  simplified  and  made 
tractable.  There  is  likely  remaining  structure  and  redundancy  in  the  problem  that  can  be 
used  to  cut  down  the  costs  and  make  it  a  practical  alternative. 

For  the  LAPDI  algorithm,  a  possible  avenue  for  progression  might  be  to  investigate 
the  ability  of  alternative  basis  sets  to  efficiently  represent  the  propagation  phase  with  fewer 
terms.  Looking  at  things  like  JPEG  compression  techniques,  Huffman  coding,  or  other  sim¬ 
ilar  techniques  to  represent  the  phase  with  fewer  terms  might  help  to  reduce  the  number  of 
phase  variables.  If  the  number  of  problem  unknowns  can  be  reduced,  the  LAPDI  approach 
will  become  better  conditioned,  less  costly,  and  more  likely  to  converge  to  an  acceptable 
solution.  In  addition  to  considering  additional  basis  sets,  introduction  of  auxiliary  informa- 
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tion  about  the  object  such  as  physical  support  like  that  done  for  correlography  [17]  might 
provide  the  extra  information  needed  to  lead  to  proper  convergence. 

Finally,  if  an  inexpensive  and  accurate  model  for  the  polarization  phase  PSD  and  its 
gradients  can  be  found,  inclusion  of  PP  data  in  the  POLE  algorithm  might  produce  better 
results.  To  evaluate  the  possibility  of  improved  reconstructions  with  a  better  model,  the 
numeric  integrations  used  to  produce  the  mapping  and  gradients  shown  in  Fig.  7.5  should 
be  evaluated  on  a  much  finer  mesh  and  with  tighter  tolerance  on  the  numeric  integration. 
This  was  attempted,  but  was  aborted  due  to  time  constraints  and  equipment  malfunctions. 
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VIII.  Conclusions 


Coherence  properties  of  laser  illumination  were  used  to  develop  an  algorithm  for  produc¬ 
ing  imagery  of  objects  using  laser  illumination.  A  maximum-likelihood  multi-frame 
active  phase-diversity  imaging  algorithm  was  derived  for  coherent  statistics  induced  as  a 
consequence  of  active  illumination.  A  statistical  model  for  object  information  encoded  in 
the  pupil-plane  intensity  distribution  was  developed  and  incorporated  into  the  derivation. 
The  resulting  algorithm  was  characterized  by  use  of  Monte  Carlo  simulations  and  shown  to 
perform  exceptionally  well.  Pupil-plane  data  was  shown  to  have  a  dramatic  positive  impact 
on  algorithm  performance  and  be  a  major  contributor  to  overall  system  performance.  Pro¬ 
vided  that  the  algorithm  functions  as  well  operationally  as  it  does  in  simulation,  it  will  be 
a  unique  and  invaluable  tool  for  imaging  satellites  or  other  space-based  objects  obscured  by 
the  earth’s  shadow  and  will  expand  overall  Space  Situational  Awareness. 

In  addition  to  developing  a  functioning  algorithm  for  use  with  conventional  measure¬ 
ment  techniques,  approaches  for  using  polarization  data  were  developed  and  presented; 
establishing  a  baseline  for  future  work.  The  groundwork  for  an  EM  algorithm  to  directly  es¬ 
timate  the  object  from  polarization  measurements  was  laid,  but  proved  too  cumbersome  to 
implement.  The  APDI  algorithm  was  modified  to  process  polarization  state  data  indepen¬ 
dent  of  other  measurements,  but  proved  too  ill-conditioned  to  produce  satisfactory  results. 
Finally,  the  APDI  algorithm  was  extended  to  utilize  polarization  state  measurements  in 
place  of  the  pupil-plane  measurements,  and  areas  ripe  for  improvement  were  identified. 

8.1  Summary  of  Results 

Monte  Carlo  analysis  of  the  APDI  algorithm  showed  that  the  impact  of  target  object, 
number  of  data  frames,  detection  SNR,  and  detector  array  size  on  the  optimal  convergence 
tolerance  is  minimal.  Under  static  aberrations,  the  optimal  convergence  tolerance  was  in¬ 
sensitive  to  variations  in  system  parameters.  For  the  dynamic  aberration  case,  the  optimal 
convergence  tolerance  was  independent  of  all  but  the  detector  array  size,  indicating  that 
minimal  characterization  is  required  for  a  specific  system  configuration. 

For  both  the  dynamic  and  static  aberration  cases,  the  optimal  conditioning  bias  was 
found  to  dramatically  impact  computation  time,  and  to  a  lesser  extent  MSE.  For  the  static 
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aberration  case,  conditioning  biases  below  10%  of  the  average  frame  intensity  resulted 
in  worse  reconstructions  and  longer  computation  times  as  the  problem  became  less  well- 
conditioned.  For  conditioning  biases  above  10%  of  the  average  frame  intensity,  the  residual 
error  climbed  steadily  and  the  computation  time  dropped  quickly  as  the  objective  function 
moved  further  away  from  the  statistically  derived  objective  function  and  became  better 
conditioned.  For  the  static  aberration  case,  the  optimal  conditioning  bias  was  found  to  be 
10%. 

For  the  dynamic  aberration  case,  anomalous  results  for  bias  values  of  40%  and  10%  are 
unexplained.  However,  the  general  trend  is  consistent  with  that  seen  in  the  static  aberration 
data  with  the  exceptions  that  the  increase  for  smaller  biases  begins  at  larger  values  and  is 
slower  than  with  the  static  aberration  case,  and  that  the  increase  in  MSE  with  increasing 
bias  is  less  pronounced.  For  the  dynamic  aberration  case,  the  optimal  bias  was  found  to  be 
50%. 

The  APDI  algorithm  is  robust  under  a  broad  range  of  scenarios.  With  over  7,000 
realizations  each  for  static  and  dynamic  aberrations  including  realizations  with  as  few  as 
ten  data  frames  and  detection  SNR  values  as  low  as  2,  none  of  the  dynamic-aberration 
reconstructions  converged  to  an  image  that  was  worse  than  the  average  of  the  focal-plane 
data.  For  static  aberrations,  roughly  3%  of  reconstructions  resulted  in  an  MSE  that  was 
higher  for  the  reconstruction  than  the  averaged  data.  However,  in  most  of  those  cases 
a  visual  comparison  of  the  reconstruction  with  the  averaged  raw  data  showed  that  the 
reconstruction  was  significantly  better  than  the  raw  data  in  spite  of  the  poor  quality  metric. 
In  all  other  cases,  the  judgment  of  whether  or  not  a  reconstruction  was  “worse”  than  the 
raw  data  became  somewhat  subjective. 

Simulations  using  varying  detection  noise  conditions,  including  both  photon  and  ther¬ 
mally  dominated  noise  sources,  showed  that  the  algorithm  is  essentially  insensitive  to  noise 
for  SNR  values  above  approximately  7,  and  that  it  still  performs  relatively  well  for  SNR 
values  as  low  as  2.  Given  that  the  algorithm  is  designed  to  deal  with  data  that  has  a  noiseless- 
detection  SNR  of  1,  it’s  no  surprise  that  it  performs  well  under  noisy-detection/low-light 
conditions. 
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Comparing  results  from  the  PP-FP  APDI  and  DP-FP  APDI  algorithms  with  the 
complete  APDI  algorithm,  it’s  clear  that  the  PP  data  has  a  much  more  significant  impact 
on  estimator  performance  than  the  conventionally  used  DP  data.  Reconstructions  done 
without  PP  data  included  resulted  in  148%  and  54%  increases  in  average  MSE  compared 
with  the  Full  APDI  algorithm  for  the  dynamic  and  static  aberration  cases  respectively.  This 
shows  that  the  PP  data  has  a  dramatic  impact  on  overall  performance  of  the  reconstruction. 

Exclusion  of  the  DP  data  from  the  APDI  algorithm  resulted  in  only  a  11%  and  24% 
increase  in  MSE  for  the  static  and  dynamic  aberration  cases  respectively.  In  cases  where 
low-light  levels  force  a  poor  SNR,  the  marginal  gain  in  performance  obtained  by  using 
all  three  data  planes  may  be  swamped  by  the  decrease  in  performance  caused  by  a  low 
detection  SNR.  This  indicates  that  a  two-channel  system  comprised  of  a  FP  and  PP  sensor 
might  be  optimal  for  these  kinds  of  scenarios.  Even  when  SNR  isn’t  a  factor,  the  additional 
computational  burden  and  hardware  requirements  for  a  complete  three-channel  system  may 
not  be  justified  by  the  marginal  gain  in  system  performance. 

The  framework  developed  for  an  EM  algorithm  proved  too  complex  and  computa¬ 
tionally  expensive  to  be  implement  able.  Even  after  making  simplifying  assumptions,  the 
maximization  step  for  the  unknown  correlation  phase  required  0(N3)  integrations  per  point 
separation,  resulting  in  0(N 4)  numeric  integrations  per  iteration.  Without  development  of 
a  simplified  statistical  model  or  derivation  of  an  analytic  maximum,  the  EM  algorithm 
approach  is  intractable  and  impractical. 

Attempts  to  use  an  APDI-type  algorithm  to  post-process  PP  polarization  state  mea¬ 
surements  did  not  result  in  satisfactory  results.  As  implemented,  the  algorithm  is  poorly 
conditioned  and  converges  to  local  maxima  that  are  not  satisfactory  reconstructions  of  the 
object.  To  illustrate  the  poorly  conditioned  nature  of  the  problem,  F  frames  of  N  x  N  PP 
polarization  state  data  consists  of  3FN2  extremely  noisy  data  values  for  a  problem  with 
(F  +  1)  N 2  unknowns.  In  a  noiseless  world,  this  should  be  sufficient  to  solve  the  system. 
However,  in  the  presence  of  noise,  it  appears  that  more  measurements  per  unknown  are 
required  for  convergence  to  an  adequate  solution.  Because  every  additional  frame  of  N  x  N 
data  introduces  3 N2  data  points  and  an  additional  N 2  unknowns,  the  problem  will  not  be 
significantly  better  conditioned  by  simply  adding  additional  frames. 
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In  addition  to  being  ill-conditioned,  the  computational  burden  for  making  each  phase 
pixel  an  optimization  variable  is  considerable.  For  the  original  APDI  implementation,  each 
iteration  requires  (6 M  +  12)  F  +  5  FFTs,  where  M  is  the  number  of  modes  being  compen¬ 
sated.  This  translates  to  to  3065  FFTs  per  iteration  for  a  30-frame  data  set  of  any  size  with 
15  modes  per  frame.  By  contrast,  the  LAPDI  algorithm  requires  (4 N2  +  12)F  +  5  FFTs  for 
each  iteration,  translating  to  roughly  5xl05  FFTs  per  iteration  for  a  30- frame  64  x  64  pixel 
data  set,  and  2xl06  for  a  30-frame  128  x  128  pixel  data  set.  If  this  approach  is  to  work,  a 
condensed  form  for  the  unknown  phase,  similar  to  the  Zernike  representation  used  for  the 
unknown  turbulence  phase  in  the  APDI  algorithm,  must  be  developed  to  reduce  the  param¬ 
eter  space  and  allow  for  a  better  conditioned  problem.  Alternatively,  additional  auxiliary 
information  about  the  object  being  reconstructed  must  be  included  in  the  algorithm. 

While  the  original  APDI  algorithm  was  demonstrated  to  produce  good  results,  inclu¬ 
sion  of  PP  polarization  data  did  not  result  in  better  performance.  Because  of  mathematical 
difficulties  with  the  statistical  description  of  the  polarization  data,  approximations  were 
made  which  adversely  impacted  estimator  performance.  Addition  of  the  polarization  data 
to  the  estimator  resulted  in  fewer  function  evaluations,  but  the  additional  computational 
burden  more  than  counterbalanced  this  improvement.  Furthermore,  the  resulting  recon¬ 
structions  were  significantly  worse  than  those  obtained  using  the  original  APDI  algorithm. 
If  polarization  data  is  to  be  used  in  place  of  PP  intensity  measurements  in  the  APDI  al¬ 
gorithm,  a  better  statistical  description  and/or  a  formulation  with  a  smaller  computational 
burden  must  be  developed  and  incorporated  into  the  estimator. 

8.2  Significant  Contributions 

The  following  items  summarize  the  significant  contributions,  extensions,  and  develop¬ 
ments  made  as  a  result  of  this  investigation: 

1.  The  first  phase  diversity  algorithm  utilizing  PP  intensity  measurements  and  designed 
specifically  to  deal  with  the  unique  statistics  of  data  from  an  active  imaging  system 
was  developed.  This  advancement  will  enable  imaging  of  exo-atmospheric  objects  over 
a  broad  range  of  engagement  scenarios  that  are  impossible  using  conventional  imaging 
techniques. 


124 


2.  It  was  demonstrated  that  for  image  reconstruction,  PP  data  has  a  more  significant 
impact  on  performance  than  the  conventional  DP  data.  PP  data  provides  a  34%  re¬ 
duction  in  MSE  with  static  aberrations  and  a  60%  reduction  in  MSE  with  dynamic 
aberrations  over  a  conventional  FP-DP  phase  diversity  geometry.  Additionally,  simul¬ 
taneous  utilization  of  FP,  DP,  and  PP  data  planes  produced  results  that  were  only 
marginally  better  than  those  obtained  with  just  a  FP-PP  configuration. 

3.  It  was  demonstrated  that  the  APDI  algorithm  is  robust  under  a  broad  range  of  detec¬ 
tion  SNR,  and  performance  was  characterized  under  a  range  of  scenarios.  The  optimal 
conditioning  bias  and  stopping  criteria  were  found,  and  were  shown  to  be  essentially 
insensitive  to  system  configuration  and  operating  conditions. 

4.  Multiple  frameworks  for  utilizing  polarization  data  to  improve  image  reconstruction 
from  actively  illuminated  targets  were  developed,  and  areas  worthy  of  future  consid¬ 
eration  were  identified. 

8. 3  Future  Work 

Opportunities  for  future  work  can  be  broken  into  two  major  areas:  continued  work  on 
the  APDI  algorithm  and  work  related  to  using  PP  polarization  data  for  imaging.  Primary 
tasks  inviting  future  work  on  the  APDI  algorithm  were  identified  and  include: 

1.  Modify  the  PP-data  model  to  better  account  for  non-square  and  partially  obscured 
pupil  functions. 

2.  Design  and  construct  a  laboratory  experiment  to  test  the  ADPI  algorithm  with  labo¬ 
ratory  data. 

3.  Perform  a  Cramer-Rao  bound  analysis  on  the  estimator. 

The  most  promising  areas  for  future  work  related  to  image  reconstruction  from  PP-polarization 
data  include: 

1.  Numerically  evaluate  the  mean  and  gradients  of  the  polarization  phase  autocorrelation 
on  a  finer  mesh  for  use  in  the  POLE  algorithm  and  compare  with  results  from  the 
APDI  algorithm. 
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2.  Derive  an  analytic  expression  for  the  polarization-phase  autocorrelation  and  its  gra¬ 
dients  for  use  in  the  POLE  algorithm. 

3.  Develop  an  inexpensive  approximation  for  the  polarization- phase  and  its  gradients  for 
use  in  the  POLE  algorithm  and  compare  with  the  APDI  algorithm. 

4.  Incorporate  prior  information  into  the  LAPDI  algorithm  to  better  condition  the  re¬ 
construction. 

5.  Investigate  techniques  for  reducing  the  number  of  parameters  required  to  represent 
the  phase  in  the  LAPDI  algorithm  including  evaluationg  basis-sets  other  than  Zernike 
modes  for  use  in  representing  the  unknown  propagation  phase. 

6.  Simplify  and  implement  the  PP-EM  algorithm. 

8.4  Summary 

As  a  consequence  of  this  work,  A  multi-frame  active  phase  diversity  imaging  (APDI) 
algorithm  was  derived  for  coherent  light  statistics  and  demonstrated.  In  addition  to  con¬ 
ventional  focal-plane  and  diversity-plane  data,  a  statistical  description  for  pupil-plane  (PP) 
intensity  was  formed  and  included  in  the  derivation.  The  algorithm  was  implemented  and 
characterized  via  Monte  Carlo  simulation.  Analysis  showed  that  the  algorithm  is  robust, 
insensitive  to  detection  noise  for  SNR  >  7,  performs  well  for  SNR’s  as  low  as  2,  and  that  the 
effect  of  system  configuration  on  optimal  parameters  is  minimal.  Furthermore,  introduction 
of  PP  data  resulted  in  a  60%  better  reconstruction  from  dynamically  aberrated  data  than 
obtained  using  only  focal-plane  and  diversity-plane  data.  Both  an  EM-algorithm  and  a 
lensless-APDI  approach  were  presented  for  generating  imagery  directly  from  PP  polariza¬ 
tion  measurements.  However,  both  approaches  are  currently  impractical.  Suggestions  for 
improvement  were  offered.  Finally,  the  APDI  algorithm  was  modified  to  use  PP  polarization 
data  in  place  of  PP  intensities.  An  initial  statistical  model  was  offered,  and  suggestions  for 
performance  improvement  were  presented. 
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Appendix  A.  APDI  Code 


This  appendix  contains  the  code  sections  used  to  build  the  APDI  algorithm.  The 
algorithm  was  built  using  Gnu  Make  and  gcc  3.4.6.  Both  the  FFTW3  and  gsl  software 
packages  are  required  to  build  the  executable.  If  the  phase  regularization  term  is  disabled, 
gsl  can  be  removed  from  the  build.  The  routines. f  file  containing  the  L-BFGS-B  code 
must  be  placed  in  the  same  directory  as  objective.solver . f . 

Listing  A.l:  Makefile 


#  set  up  directory  variables 
#HOME=~  p  j  ohnson 
CROOT  = . 

FR00T=$ (CROOT) /solver 
MLIB=$ (CROOT) /lib 


#  set  up  the  relative  paths 
psd_reg=$ (CROOT) /psd_reg 
solver=$(FR00T) 

objctive=$( CROOT )/regd_objective 
myhdrs=$ (CROOT) /lib 
core_reg=$( CROOT ) /core.objective 
phase_reg=$ (CROOT) /phs_reg 

#  set  up  compilers  and  flags . . . 

CC=gcc 

FC=g77 

CFLAGS=-03  -march=prescott  -Wall  -std=c99 
FFLAGS=-03  -march=prescott  -Wall 
#CFLAGS=-g 
&FFLAGS  =  - g 

LIBS =-L$ (HOME) /lib/  -L/usr/ local /lib 

INC  =  - If  f  tw3  -lm  — 1 g  s 1  -lgslcblas 

FINC=-lf rtbegin  -lg2c  -lm  -lgcc_s  -lgcc  - lc 

SEARCH=- 1$ (myhdr s )  -I$(psd_reg)  -1$ (solver)  -I$(objctv)  - 1$ ( cor e_r eg ) \ 
-I$(CR00T)  -1$ (phase_reg) 

#  set  up  other  things  that  we’ll  use... 
mylib=$(MLIB)/mylibs 
psdreg=$(psd_reg)/psd_reg 
objctv=$(objctive)/regd_objective 
slvr=$( solver) / objective_solver 
main=$( solver)/ f reest and ing_ solver 
routines=$(solver)/routines 
core=$(core_reg)/ core_objective 
zerns=$(MLIB)/zernikes 
phs_reg=$(phase_reg) /phs_reg 

#  define  the  objects  needed  to  build  APDIsolve 

OB JS=$ (mylib ) . o  $(psdreg).o  $(objctv).o  $(slvr).o  $(main).o  $ ( r out ine s ) . o  \ 
$(core).o  $(zerns).o  $(phs_reg).o 

#  now  set  up  what  we  need  to  build  the  program 
APDIsolve:  $(0BJS) 

$(CC)  $ ( OB JS )  $ ( CFLAGS )  $(INC)  $(FINC)  $(SEARCH)  $(LIBS)  -o  APDIsolve 

$(phs_reg) .o:$(phs_reg) .c  $(phs_reg) .h 

$(CC)  -c  $(phs_reg).c  $(CFLAGS)  $(SEARCH)  -o  $(phs_reg).o 

$ ( psdr eg ) . o : $ ( psdreg ) . c  $(psdreg).h  $(mylib).o 

$(CC)  -c  $(psdreg).c  $(CFLAGS)  $(SEARCH)  -o  $(psdreg).o 
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$ ( thr eg ) . o : $ ( thr eg ) . c  $(mylib).o 

$(CC)  -c  $ ( thr eg ) . c  $(CFLAGS)  $(SEARCH)  -o  $(threg).o 

$(slvr) . o : $ ( slvr ) .f  {(routines) .o 

$ ( FC )  -c  $ ( slvr ) . f  $ ( FFLAGS )  $(SEARCH)  -o  $(slvr).o 

{(routines) .o:{(routines) .f 

$(FC)  -c  $ ( r out ine s ) . f  $(FFLAGS)  $(SEARCH)  -o  $ ( r out ine s ) . o 

$(objctv) .o:$(objctv) .c  $(objctv) .h 

$(CC)  -c  $(objctv).c  $(CFLAGS)  $(SEARCH)  -o  $(objctv).o 

{(main) .o:$(main) .c 

$(CC)  -c  {(main) . c  $(CFLAGS)  $(SEARCH)  -o  {(main) .o 

$( core ). o :$( core ). c  {(core) .1 

$(CC)  -c  {(core) .c  $(CFLAGS)  $(SEARCH)  -o  {(core).o 

$ ( myl ib ) ,o:$(mylib) .c 

$(CC)  -c  $ ( myl ib ) . c  $(CFLAGS)  $(SEARCH)  -o  $(mylib).o 
((zerns) ,o:{(zerns) .c 

$(CC)  -c  {(zerns) , c  $(CFLAGS)  $(SEARCH)  -o  ((zerns) .o 


.PHONY:  clean 
clean : 

rm  $(0BJS)  APDIsolve 


Listing  A. 2:  config.h 

/*  Peter  Johnson 

*  6  Sep  2006 

* 

*  config . h 

* 

*  this  file  is  for  configuration  options  that  affect  all  of  the  modules  of 

*  the  APDI  algorithm... 

*/ 

//  decide  if  this  is  the  top-level  or  not  for  global  variables... 

# if  def  LOCAL 
double  bias=0.5; 
double  ^inverse ; 
int  inverse.exists =0 ; 

#else 

extern  double  bias ; 
extern  double  ^inverse; 
extern  int  inverse_exist s ; 

#endif 


Listing  A. 3:  f  reestanding_solver .  c 

/*==================================================================== 

*  Peter  Johnson 

*  16  Feb  2006 

* 

*  this  program  is  for  implementing  the  APDI  imaging  algorithm 

*  without  matlab .  The  motiviation  for  this  is  for  debugging  and 

*  speed.  Output  from  the  L-BFGS-B  algorithm  cannot  be  fed  into 

*  matlab ,  and  seg-faults  and  the  like  are  difficult  to  debug  when 

*  running  under  matlab.  Instead ,  the  simulated  data  are  pre-computed 

*  with  matlab,  and  loaded  via  data  files  here. 

$  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  ^:  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =>k/ 

#def ine  LOCAL 
#include  "config.h" 
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# include 
# include 
# include 


<  stdio . h> 
<stdlib .h> 

<  st r ing . h> 


/*  include  the  prototype  for  the  fortan  solving  function  */ 
void  apds lv_ ( double  *L,  double  *guess ,  double  *G,  double  *data, 

int  *frms ,  double  *div,  int  *dim ,  int  *k ,  double  *DrO, 
double  *tol  ); 


// 

prototype 

for 

the  help  print  function 

voi 

d  prnthelp 

() ; 

int 

main ( int 

argc 

,  char  *  argv  []  ) 

{ 


FILE  * 

out  _f 1  =  NULL ,  * dat a_fl  =  NULL ; 

// 

file  pointers 

FILE  * 

guess.f 1=NULL ; 

// 

file  pointers 

s ize_t 

dbl=sizeof (double) ; 

// 

sizes 

s ize_t 

cntr  ; 

// 

count er 

int  m  , 

k  ,  f ,  mm ; 

// 

objective  parameters 

int  f  1 

.error  ; 

// 

file  error  flag 

int  cO 

; 

// 

loop  counter 

int  runk=0; 

// 

ah  err  at  ions  to  correct 

extern 

double  bias ; 

// 

conditioning  bias 

double 

tol  ; 

// 

tolerance  for  convergence 

double 

L  ,  div  ,  DrO  ,  DrOd  ; 

// 

objective  variables 

double 

double 

♦guess ,  *G,  *data; 
tmpl  ; 

// 

ptr  to  data,  guess,  grad 

char  outf 1  [256]  ,  dtaf 1  [256],  gssfl  [256]; 

// 

fil enames 

//  set  the  default  command -line  parameters 
bias  =  0.5; 
tol  =  2e -7 ; 

strcpy (out fl  ,  "results . dat ")  ; 
strcpy (dtaf 1 , " data . dat " ) ; 
strcpy (gssfl , "guess . dat") ; 
runk  =  15 ; 

DrO  =  0; 

////////////////////////////////////////////////////////////////////////// 

//  parse  the  command -line  arguments 

////////////////////////////////////////////////////////////////////////// 

if  (  argc  >1  ) 

{ 

for  (c0=l;  c0<argc;  c0++) 

{ 

//  see  if  we  were  passed  a  bias 
if  ( ! st r cmp ( argv  [ cO]  , " -bias " ) ) 

{ 

bias  =  at  of  ( argv  [  cO  + 1]  )  ;  //  convert  bias  to  a  double 

cO  ++  ; 


ere  passed  a  convergence  tolerance 
np(argv[c0] , "-tol") ) 

tol  =  atof ( argv [cO  +  1] )  ;  //  convert  tolerance  to  a  double 


//  see  i 

/ 

we 

else  if 

(! 

st 

{ 

tol 

= 

at 

cO  ++ 

J 
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//  see  if  we  were  passed  an  output  filename 
else  if  (  !  str cmp  (  argv  [ cO]  ,  " -o  "  )  ) 

{ 

cntr  =  strlen ( argv [cO  +  1] ) ; 
if  ( cntr  <  =  256) 

str cpy ( outf 1 ,  argv[cO+l]); 
cO  ++ ; 

} 

//  see  if  we  were  passed  an  initial  guess  file  name 
else  if  (! strcmp ( argv [cO] -guess ") ) 

{ 

cntr  =  strlen ( argv [cO+1] ) ; 
if  ( cntr  <  =  256) 

strcpy (gssf 1 ,  argv[cO+l]); 
cO  ++ ; 

} 

//  see  if  we  were  passed  a  datafile  name 
else  if  (! strcmp (argv [cO] -data" ) ) 

{ 

cntr  =  strlen ( argv [cO  +  1] ) ; 
if  (cntr<256) 

str cpy ( dt af 1 ,  argv[cO+l]); 
cO++ ; 

} 

//  see  if  we  were  passed  a  number  of  modes  to  correct 
else  if  (! strcmp (argv [cO] , "-modes")) 

{ 

runk  =  atoi  ( argv  [cO  +  1]  )  ;  //  convert  modes  to  int 

cO  ++ ; 

} 

//  see  if  we  were  passed  a  DrO  to  run  for  aberr  regulation 
else  if  (!  str cmp ( argv  [ cO]  -DrO ") ) 

{ 

DrO  =  atof ( argv [cO+1] ) ;  //  convert  DrO  to  a  double 

cO++ ; 

} 

//  print  help  message  if  asked  for 

else  if  (!  str cmp ( argv  [ cO]  -help " )  I  I  ! strcmp ( argv  [cO]  -h" ) ) 

{ 

prnthelp ( ) ; 
return  0; 

} 

//  print  help  message  if  weJre  confused 
else 
{ 

pr intf ( " \nlnvalid  Command  Line  Opt ion\n\n" ) ; 
prnthelp ( ) ; 
return  0; 

} 

> 

pr int f ( " \n " )  ; 

printf("Bias  =  °/0g\n  "  ,  bias  )  ; 
printf  (  "Tolerance  =  °/0g\n"  ,tol); 
pr  int  f  ("  Output  File  =  °/0s \n  "  ,  out  f  1 )  ; 
pr  int  f  ("  Guess  File  =  °/«s\n"  ,  gssf  1 )  ; 
printf  ("Data  File  =  °/,s\n"  ,  dtaf  1)  ; 


> 

/////////////////////////////////////////////////////////////////////////// 
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//  load  the  data 

/////////////////////////////////////////////////////////////////////////// 


//open  the  data  file 
data_fl  =  f open ( dtaf 1 , " r " ) ; 

if  (data_f 1  ==  NULL) 

{ 

pr int f ( " \nError :  Could  not  open  the  data  file  for  reading\n") ; 
return  1 ; 

> 

//  load  the  data  from  the  file... 


fread(&tmpl,  dbl. 

,  i , 

data_f 1 ) ; 

// 

read 

the  array  dim 

m  =  ( int ) tmpl ; 

// 

cast 

it  to  an  int 

f read (&tmpl ,  dbl  a 

,  i , 

dat a_f 1 )  ; 

// 

get 

number  of  frames 

f  =  ( int ) tmpl ; 

// 

cast 

i  t 

f read (& tmpl  ,  dbl. 

,  i , 

data_f 1 )  ; 

// 

read 

#  of  ab err  at  ions 

k  =  ( int ) tmpl ; 

// 

cast 

i  t 

fread(&DrOd,  dbl  a 

,  i . 

dat a_f 1 )  ; 

// 

get 

D/rO 

fread(&div,  dbl. 

i , 

data_f 1 )  ; 

// 

read 

the  diversity  factor 

mm  =  m*m ; 

// 

compute  number  of  pixels 

data  =  malloc  (  (2*f  +  1)  *mm*dbl)  ;  //  allocate  data  array 

cntr  =  fread(data,  dbl ,  (2*f +1) *mm ,  data_fl);  //  read  in  data  frames 

if (  cntr  !=  (2*f +1) *mm) 

{ 

printf ( "\nError :  Couldn’t  read  right  number  of  pixels  from  file"); 
pr  int  f  ( " \n\t°/«d  elements  requested,  °/«d  elements  read\n"  ,  (2*f+l)*mm, 

cntr)  ; 
return  1 ; 

> 

//  close  the  data  file 
f l.error  =  f close ( data_fl ) ; 
if  (fl_error) 

{ 

printf ( "\nError  closing  data  file\n"); 
return  1 ; 

> 

////////////////////////////////////////////////////////////////////////// 
//  load  or  generate  the  initial  guess  data 
////////////////////////////////////////////////////////////////////////// 

//  try  to  open  the  guess  file 
guess_fl  =  f open ( gssf 1 , " r " ) ; 

//  if  we  can’t  open  the  guess  file,  generate  a  starting  guess 
if  (guess_fl  ==  NULL) 

{ 

pr int f ( " \nCouldn ’ t  open  guess  file,  generating  guess  internally \n" ) ; 
guess  =  calloc ( (mm+f *k) , dbl ) ;  //  allocate  memory  for  the  guess 

//  set  oh j  pixels  to  a  constant  unit-average 
for  (c0=0;  cO<mm;  c0  +  +  ) 

*(guess+cO)  =  1.0;  //  constant  initial  oh j  guess 

//  set  the  number  of  modes  if  called  for 
if  (runk>0) 
k  =  runk ; 

//  allocate  memory  doe  the  guess 
guess  =  calloc ( (mm+f *k) , dbl ) ; 

//  set  the  aberration  guess  to  zero 
for  (cO=mm;  cO<(mm+f*k);  c0++) 
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* ( guess  +  cO )  =  0 ; 

> 


//  zero  aberration  initial  guess 


//  otherwise  open  the  data  file  and  read  in  the  guess 
else 
{ 

pr  int  f  (  " \nReading  guess  data  from  °/«s\n"  ,  gssf  1 )  ; 


//  read  and  verify  m 
f read (&tmpl ,  dbl,  1,  guess_fl); 
if  ((int)tmpl  !=  m) 

{ 

printf ("\nError :  Object  dimensions  don’t  match  data  dimensions"); 
return  1; 

> 


//  read  and  verify  f 
fread(&tmpl,  dbl,  1,  guess_fl); 
if((int)tmpl  !=  f) 

{ 

printf (" \nError :  Different  number  of  frames  from  data  file"); 
return  1; 

> 


//  read  k  don’t  need  to  verify...  will  work  with  whatever  is  provided 
fread(&tmpl,  dbl,  1,  guess_fl); 


//  read  and  verify  DrO 
f read (&tmpl ,  dbl,  1,  guess_fl); 
if(tmpl  !=  0  &&  tmpl  !=  DrOd) 

{ 

printf (" \nError :  D/rO  is  different  or  not  zero"); 
pr intf  (  " \n\ tDat a  =>  Zg\nGuess  =>  °/0g"  ,  DrOd  ,  tmpl  )  ; 
return  1; 

> 

if  (Dr0==-1)  //  use  the  data  DrO  to  run  if  called  for 

DrO  =  DrOd; 


//  read  and  verify  div 
fread(&tmpl,  dbl,  1,  guess_fl); 
if ( tmpl  ! =  div ) 

{ 

printf (" \nError  :  Diversity  is  different  from  data  file"); 
printf("\n\tData  =>  Zg\nGuess  =>  °/0g  ",  div  ,  tmpl  )  ; 
return  1; 

> 


//  allocate  memory  for  the  guess  and  read  it  in 

guess  =  malloc ( (mm+f *k) *dbl ) ;  //  allocate  guess  array 

cntr  =  fread(guess,  dbl,  mm+f*k,  guess_fl);  //  read  in  the  guess 

if  (cntr  !=  mm+f*k) 

{ 

printf (" \nError :  Could  not  read  from  data  file"); 

printf  (" \n\t%d  elements  requested,  °/0d  elements  read ",  mm  +  f  *k  ,  cntr); 

return  1; 


//  close  the  data  file 
fl_error  =  f close ( guess_fl ) ; 
if  (fl_error) 

{ 

printf (" \nError :  Error  closing  connections  to  guess  file.\n"); 
return  1; 

> 

> 
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////////////////////////////////////////////////////////////////////////// 
//  call  the  solver  routine .. . 

////////////////////////////////////////////////////////////////////////// 

//  put  out  some  info 

pr  intf  (  " \nlmage  is  °/0d  by  °/«d  "  ,  m  ,  m)  ; 

printf  (  " \nProcessing  °/«d  frames",  f); 

printf  ( " \nProcessing  °/0d  aberrations  per  frame", k); 

printf  (" \nUsing  D/rO  =  °/0g",DrO); 

printf  (" \nUsing  °/«g  waves  of  divers  it  y \n  ",  div  )  ; 

//  allocate  the  gradient  array 

G  =  malloc ( (mm+f *k) * dbl ) ; 

//  call  the  solver  routine 

apdslv_(&L,  guess,  G,  data,  &f ,  &div ,  &m ,  &k ,  &DrO ,  &tol); 

////////////////////////////////////////////////////////////////////////// 
//  write  the  results  to  the  output  file 
////////////////////////////////////////////////////////////////////////// 

out_fl  =  f open ( outf 1 , " w" ) ; 
if (out_f 1==NULL) 

{ 

printf ( "\nError :  Could  not  open  output  file  for  writing\n"); 
return  1 ; 

> 

else 

printf (" \nWriting  results  to  output  file\n"); 
tmpl  =  m; 

cntr  =  fwrite (&tmpl ,  dbl,  1,  out_fl); 
if ( cntr  ! =  1 ) 

{ 

printf ( "\nError :  Could  not  write  to  output  file\n"); 
return  1 ; 

> 

else  printf  ("\nm  =  #/0d",m); 
tmpl  =  f ; 

cntr  =  fwrite (&tmpl ,  dbl,  1,  out_fl); 
if  (  cntr  !  =  1 ) 

{ 

printf ( "\nError :  Could  not  write  to  output  file\n"); 
return  1 ; 

> 

else  printf  ("\nf  =  °/,d",f); 
tmpl  =  k; 

cntr  =  fwrite (&tmpl ,  dbl,  1,  out_fl); 
if ( cntr  ! =  1 ) 

{ 

printf ( "\nError :  Could  not  write  to  output  file\n"); 
return  1 ; 

> 

else  printf  ("\nk  =  °/,d",k); 

cntr  =  fwrite (&DrOd ,  dbl,  1,  out_fl); 
if ( cntr  ! =  1 ) 

{ 

printf ( "\nError :  Could  not  write  to  output  file\n"); 
return  1 ; 

> 

else  printf  (" \nD/rO  =  #/,g",DrOd); 

cntr  =  fwrite (&div,  dbl,  1,  out_fl); 
if ( cntr  ! =  1 ) 
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t 

pr int f ( " \nError :  Could  not  write  to  output  file\n"); 
return  1 ; 

> 

else  printf  (  " \nDiversity  =  #/«g",div); 

cntr  =  f writ e ( guess ,  dbl ,  mm+f*k,  out_fl); 
if (cntr  !=  mm+f*k) 

{ 

printf ( "\nError :  Could  not  write  to  output  file\n"); 
return  1 ; 

> 

else  pr  intf  (  " \nWrot  e  °/0d  object  and  aberration  mm\n"  ,  cntr  )  ; 

//  close  the  output  file 
f l.error  =  f close ( out_fl ) ; 
if ( f l_error ) 

{ 

printf ( "\nError :  Error  closing  connections  to  data  files. \n"); 
return  1 ; 

> 

//  free  up  working  arrays 
free (G)  ; 
free ( data) ; 
free ( guess )  ; 

return  0; 


void  prnthelpO 

{ 

printf ("\n  input  options :\n") ; 

printf (" \t-bias  [double]  ==  bias  offset  for  regular izaiton\n" ) ; 

pr intf ( " \t - tol  [double]  ==  convergence  t olerance \n " ) ; 

printf ("\t-o  [filename]  filename  for  the  resulting  output  data\n")  ; 

printf (" \t-guess  [filename]  filename  with  initial  guess  data  in  it\n"); 

pr intf (" \t - data  [filename]  filename  with  measured  data  in  it\n"); 

pr intf (" \t -modes  [int]  ==  number  of  Zernike  modes  to  correct.  This\n"); 

printf ("\t  can’t  be  used  with  an  external  starting  guess\n") ; 

printf ("\t  If  a  guess  is  supplied,  it’s  mode  number  wins .\n")  ; 

pr intf ( " \t -DrO  [double]  ==  DrO  for  running  phase  regularization  term.\n") ; 

printf  ("\t  set  to  -1  to  use  DrO  from  the  data"); 

printf ("\t  set  to  0  to  disable  regularization  (default)"); 

pr intf (" \t -help  ==  print  this  me s sage \n " ) ; 

printf ("\t-h  ==  print  this  message\n") ; 


> 

Listing  A. 4:  objective.solver  .f 

C  Peter  Johnson 

C  15  Feb  2006 

C 

C  This  section  of  fortran  code  is  intended  to  call  the  objective 

C  function  and  the  numerical  solver  routine.  The  objective  function 

C  is  written  in  C,  but  should  be  callable  from  fortran  as-is.  The 

C  numerical  solver  routine  is  the  LBGFS-B  code  available  from 

C  argonne  national  labs ,  and  is  used  as-is.  This  file  is  based  on 

C  the  driverl.f  file  that  is  provided  with  the  numerical  solver 

C  package. 

C  This  currently  can  handle  a  128x128  pixel  array  with  up  to  a  total 

of  128~2  aberration  parameter es  evenly  distributed  across  all  the 
frames.  To  expand  that ,  increase  nmax  below. 
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subroutine  apdslv(f,  x,  g,  dta ,  frms ,  div ,  dim,  k,  DrO ,  mytol) 

19 

integer  frms,  dim,  k; 

double  precision  f,  x ( dim* dim  +  f rms *k) ,  g ( dim* dim  + frms *k) , 

+  dta  (  (2* f rms +  1) *dim*dim)  ,  div,  DrO,  mytol 

C 

set  up  some  solver  parameters 

24 

integer  pixels 

integer  nmax ,  mmax 

parameter  (nmax=32786,  mmax=15) 

C 

mmax  is  the  dimension  of  the  largest  problem  to  be  solved 

C 

29 

C 

nmax  is  the  maximum  number  of  limited  memory  corrections 

Declair  variables  needed  by  the  solver... 
character *60  task,  csave 

34 

logical  lsave (4) 
integer  n,  m,  iprint  , 

+  nbd(nmax),  iwa (3* nmax )  ,  isave  (44) 

double  precision  factr ,  pgtol  , 

+  l(nmax),  u(nmax),  dsave (29) , 

+  wa (2*mmax*nmax+4*nmax+12*mmax*mmax+12*mmax) 

character*10  interl ,  interG 

39 

C 

declair  any  additional  variables... 

integer  cO ,  iter 

double  precision  fold,  myfctr 

44 

iter  =  1 

cO  =  0 

fold  =  0 

inter I = ’ iOOOO . dat  * 

49 

C 

interG=,g0000 . dat  * 

squelch  output... 

C 

c 

iprint  =  -1 

or  output  every  iteration 
iprint  =  1 

54 

C 

C 

specify  the  tolerances  in  the  stopping  criteria 
factr  =  frms*1.0d+10 

C 

59 

p  gt ol  =  1 . 0  d~3 
factr  =  0 
pgtol  =  0 
myfctr  =  mytol 

C 

64 

specify  the  size  of  the  problem  and  the  number  of  corrections 
pixels=dim*dim 
n  =  pixels +f rms *k 
m  =  15 

C 

69  C 

C 

c 

now  set  up  nbd ,  and  the  upper  and  lower  bounds  on  the  variables 
nbd(i)  is  the  type  of  bound  for  this  variable 

nbd  =  0  =>  no  bound  1  =>  lower  bound  2=>  upper  and  lower  bound  3 
=>  upper  bound  only 

c 

c 

74  C 

l(i)  is  the  lower  bound  on  the  i ’ th  variable 
u(i)  is  the  upper  bound  on  teh  i 1 th  variable 

C 

set  the  lower-bound  on  the  pixel  intensities  .. . 

c 

79 

lower-bound  the  image  positive 
do  10  c0=l, pixels 
nbd(cO)  =  1 

1  (  cO )  =  0 
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rjQjo  c*  oqj 


10 


cont inue 


C  set  the  aberration  variables  to  be  unbounded 

do  12  cO=l , f rms *k 

nbd ( cO+pixels )  =  0 
12  continue 

C  Start  the  iteration  by  initializing  task 

task  =  *  START  ’ 


C  - Begin  the  solving  loop  - 

110  continue 

C  call  the  L-BFGS-B  code 

call  setulb(n,m,x,l,u, nbd , f , g , f actr , pgt ol , wa , iwa , task , ipr int  , 

+  csave  ,  lsave  ,  isave  ,  dsave ) 

if  (task(l:2)  . eq .  1 FG J )  then 

if  we  get  to  this  point  it  is  because  the  solver  wants  the 
objective  value  and  the  gradient  at  the  current  guess 

f  =  objctv(g,  x,  dta ,  div ,  frms ,  k,  dim,  DrO) 

go  back  to  the  minimization  routine 
goto  110 
endif 

if  (task(l:5)  . eq .  1 NEW_X ’ )  then 

if  we  make  it  here,  the  solver  has  come  up  with  a  new  guess  and 
wants  to  continue  execution. . . 

kill  the  iterations  if  we  have  evaluated  the  objective  too  many  times. 
if  (isave(34)  . ge .  10000) 

+  task=’ST0P:  exceeded  10000  function  evals ’ 

kill  iterations  if  we  don’t  make  enough  progress...  have  to  do  it 
here  because  machine  precision  is  different  across  platforms  so  b / in 
factr  doesn’t  work 

if  (abs(fold-f)  . le .  abs (f *myf ctr ) ) 

+  task="ST0P:  relative  reduction  too  small  to  continue" 

C  reset  fold  to  the  current  guess  for  the  next  iteration 

fold  =  f 

C  go  back  to  the  top  of  the  loop  and  keep  going 

goto  110 
endif 

C  if  the  task  isn’t  NEW_X  or  ’FG’,  then  we  are  done,  and  we  stop. 

end  subroutine 


Listing  A. 5:  regcLobjective  .h 

/*  Peter  Johnson 

*  27  Feb  2006 

* 

*  This  is  the  header  file  for  the  function  that  is  intended  to  combine  all 

*  of  the  r  egul  ari  zait  on  and  objective  terms  into  one  function  that  can  be 

*  called  from  fortran  for  use  with  the  numerical  solver. 

* 

*  The  measured  psd  data  passed  to  this  function  needs  to  be  averaged  across 

*  all  of  the  frames  before  being  passed  to  this  function. 
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* 

*  The  focal-  and  diversity  plane  data  should  all  be  passed  in  as  a  single 

*  array  with  the  focal -plane  and  diversity  plane  interleaved  in  the 

*  following  order. . . 

* 

*  focall,  divl ,  focal2,  div2 ,  etc... 

* 

*  The  guess-object  pixel  values  and  the  unknown  phase  aberration  parameters 

*  are  contained  in  a  single  array  with  the  phase  aberration  parameters 

*  concatenated  to  the  end  of  the  object  array.  The  gradient  array  is 

*  identically  distributed. 

* 

*  Because  this  needs  to  be  callable  from  fortran ,  all  arguments  are  passed 

*  by  reference,  and  the  function  name  is  mangled  appropriately 
*/ 


#include  " core.ob j ect ive . h" 

#include  "psd_reg.h" 

double  ob j ctv_ ( double  *G,  double  *guess  ,  double  *data,  double  *div, 
int  *f  ,  int  *k ,  int  *m ,  double  *DrO); 

/*  double  ob j ctv_  ( double  *G ,  double  *guess,  double  *data,  double  *div, 

*  int  f ,  int  *k,  int  *m,  double  DrO) 

* 

*  This  function  returns  the  value  of  the  regularized  objective  function 

*  give  the  focal-plane,  diversity -plane ,  and  pupil -plane  data  from  the 

*  active-imaging  problem.  The  input  data  array  should  contain  the 

*  focal-plane  and  diversity  plane  intensity  frames  interleaved  as  described 

*  above,  followed  by  the  final  frame  of  averaged  pupil-plane  speckle  psds . 

*  The  first  2*f*m~2  elements  of  the  data  array  contain  the  focal  and 

*  diversity  terms,  and  the  final  m~2  elements  contain  the  averaged 

*  pupil-plane  speckle  psd. 

* 

*  The  current  guess  for  which  the  objective  function  is  to  be  evaluated  is 

*  contained  in  the  guess  array,  and  consists  of  the  object  pixel  values  in 

*  the  first  m~2  elements ,  and  the  aberration  parameters  in  the  remaining 

*  f *k  e l ements . 

* 

*  The  known  aberration  for  the  diversity  plane  is  described  by  the  value 

*  contained  in  div .  This  is  the  maximum  number  of  wavelengths  of  phase 

*  added  to  the  phase-front  in  the  pupil-plane  (i.e.  the  amount  of  phase 

*  added  in  the  corners  of  the  array)  . 

* 

*  The  computed  gradient  at  the  current  guess  is  evaluated  and  returned  via 

*  the  G  array.  This  has  the  same  structure  as  the  guess  array,  with  the 

*  gradients  wrt  to  pixel  values  returned  in  the  first  m~2  elements ,  and  the 

*  gradients  wrt  the  aberration  parameters  given  in  the  final  k  elements . 

* 

*  The  estimate  for  D/rO  needs  to  be  passed  in  for  the  phase-regularization 

*  to  work.  If  it  is  unknown ,  set  D/r0==0  to  bypass  that  part  of  the 

*  regularization . 

*/ 

Listing  A. 6:  regcLobjective .  c 

/*  Peter  Johnson 

*  27  Feb  2006 

* 

*  Because  this  needs  to  be  callable  from  fortran ,  all  arguments  are  passed  by 

*  reference,  and  the  function  name  is  mangled  appropriately 
*/ 

#include  "mylibs.h" 

#include  " regd_ob j ect ive . h" 

#include  "phs_reg.h" 
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double  ob j ctv_ ( double  *G,  double  *guess  ,  double  *data,  double  *div. 


int  *f  ,  int  *k ,  int 

*m  , 

double  *DrO) 

int  cO  . 

,  abers 

// 

counter 

int  pixels  = 

^  m  ♦  in  ; 

// 

pixels  in 

the  object 

double 

*  aberr 

»  *  obj  ; 

// 

pointers 

to  guess  obj  and  aberr 

double 

*f  ocal 

,  *psd; 

// 

pointers 

to  measured  data 

double 

*G_I  , 

*G_pp ,  *G_phs ,  *G_p; 

;  // 

pointers 

to  gradient  components 

double 

o 

ii 

-j 

// 

objective 

func tion  value 

abers  =  *f  *  *k; 

//  reset  the  gradient  to  zero 
for  (c0=0;  cO<pixels  +  abers;  c0++) 
*  (  G  +  cO )  =  0 ; 


//  break  out  the  different  pieces  of  the  data  and  the  gradients  from 
//  the  passed -in  data  and  guess  arrays 

obj  =  guess;  //  object  in  the  first  mxm  slots 

aberr  =  guess +  pixels  ;  //  aberration  in  the  last  f*k  slots 

G_I  =  G;  //  obj  grad  in  first  mxm  slots 

G_phs  =  G+pixels ;  //  aberr  grad  in  last  f*k  slots 

focal  =  data;  //  focal -plane  data  in  first  2*f*m~2  slots 

psd  =  data+ (2* ( * f ) *pixels ) ;  //  psd  data  in  last  m~2  data  slots 


//  call  the  core  objective  function  to  get  the  initial  values 
L  =  ob j ect ive ( ob j  ,  aberr,  *div ,  focal,  G_I  ,  G_phs  ,  *f  ,  *k,  *m)  ; 

//  add  the  regularization  terms 


//  do  the  intenisty  regularization 
G_pp  =  calloc ( pixels  , s izeof ( double ))  ; 

L  +=  *f  *  psd_reg(obj,  psd,  G_pp ,  *m) ; 
for  (c0=0;  cCKpixels;  c0  +  +  ) 

*(G_I+cO)  +=  *f  *  *(G_pp+cO); 


//  allocate  working  array 
//  compute  the  contribution 
//  add  the  gradient  contrib 


//  do  the  phase  regularization  if  we  know  D/rO 
if  (*DrO  ! =  0) 

{ 

G_p  =  calloc ( abers  ,  s izeof ( double ))  ;  // 

L  +=  covar_reg ( aberr ,  G_p ,  *k,  *f ,  *DrO) ;  // 
for  (c0=0;  c0<abers;  c0++)  // 

*(G_phs+cO)  +  =  *(G_p+cO); 
free (G_p)  ;  // 

> 


allocate  memory 
compute  the  contribution 
add  the  grad  contrib 

free  memory 


//  free  working  arrays 
f  ree ( G_pp )  ; 


//  return  the  objective  function  value 
return  L; 


Listing  A. 7:  core_objective  .h 

/*  Peter  Johnson 

*  25  Jan  2006 

* 

*  core_objective.h 

* 

*  This  file  is  the  header  file  containing  the  function  prototypes  for 

*  calling  the  non - regul ari zed  active  focal -plane  phase  -  diversity  image 

*  reconstruction  objective  function  and  gradient  . 

*/ 
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/*  # include  "mylibs.h"  */ 


double  obj ect ive ( double  *obj  ,  double  *aberr ,  double  div ,  double  *data , 
double  *G_I  ,  double  *G_phs ,  int  f,  int  k,  int  m) ; 

/*  ======================================================================== 

*  double  objective(double  *obj,  double  * aberr,  double  *data,  double 

*  *G ,  int  k ,  int  m) ; 

* 

*  master  function  for  computing  the  focal-plane  PD  objective  function  value 

*  and  associated  gradients.  The  function  returns  the  value  of  the 

*  objective  function  ,  and  stores  the  gradients  wrt  to  each  object  pixel  in 

*  the  array  pointed  to  by  G. 

* 

*  double  * ob j  ==  pointer  to  m  x  m  object  array  (current  guess) 

*  double  *aberr  ==  pointer  to  the  k-element  unknown  aberration 

*  parameter  array 

*  double  div  ==  number  of  waves  of  KNOWN  defocus  on  the  diversity 

*  channel 

*  double  *data  ==  pointer  to  m  x  m  x  2  array  of  avgd  speckle  data 

*  the  first  frame  is  the  avg  of  the  in- focus  images , 

*  and  the  second  is  the  avg  of  the  diversity  images. 

*  double  *G  ==  pointer  to  m  x  m  gradient  array 

*  int  f  ==  number  of  frames  of  data  begin  processed 

*  int  k  ==  number  of  aberration  parameters  ( zernike  modes) 

*  int  m  ==  number  of  pixels  along  each  dimension  of  the  input  array 

*/ 


void  mkscreen ( complex  *screen,  int  *msk  ,  int  m,  double  *aberr ,  int  k) ; 

/ *  ======================================================================== 

*  void  mkscreen ( complex  *  screen,  int  m,  double  *aberr,  int  k) ; 

* 

*  generate  an  m  x  m  phase  screen  using  the  first  k  zernike  modes ,  with  the 

*  zernike  mode  weights  contained  in  the  array  pointed  to  by  aberr.  Piston 

*  is  not  counted  in  the  aberr  array,  so  the  first  element  of  the  array 

*  corresponds  to  the  weight  of  the  2nd  Noll-indexed  zernike  mode  (tilt). 

* 

*  complex  *screen  ==  pointer  to  m*m  complex  array  to  store  result  in  int  m 

*  ==  number  of  pixels  in  each  dimension  of  the  array  double  *aberr  == 

*  pointer  to  Noll-indexed  zernike  mode  weights  int  k  ==  number  of  elements 

*  in  the  weight  array /numb  er  of  modes 
*/ 


void  con j _dotmult iply ( complex  *out  ,  complex  *in,  int  m)  ; 

/*  ======================================================================== 

*  void  conj_dotmultiply(complex  *out,  complex  *in,  int  m) 

* 

*  does  a  dot-multiply  where  the  result  overwrites  the  first  input,  and  the 

*  result  is  the  2*creal(out  *  conj(in)) 

* 

*  complex  *out  ==  pointer  to  the  output/first  input  array 

*  complex  *in  ==  pointer  to  the  second  input  array 

*  int  m  ==  number  of  elements  in  the  arrays. 

*/ 

void  flip ( complex  *in,  complex  *out,  int  m) ; 

/ *  ======================================================================== 

*  void  flip(complex  *in,  complex  *out,  int  m) 

* 

*  flips  the  array  over  fft-style  so  f(x,y)  =>  f(-x,-y)  with  the  origin 

*  still  in  the  right  place... 

*/ 
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Listing  A. 8:  core_objective .  c 

/*  Peter  Johnson 

*  7  Nov  2006 

* 

*  this  file  contains  the  functions  used  for  evaluation  of  the 

*  un-regul ari zed  focal -plane  objective  function  and  gradient . 

* 

*  as  of  8  Nov  2006  this  has  been  verified  completely ,  and  appears  to  work 

*  as  expected . 

* 

*  see  core_ob j ective . h  for  calling  sequence  and  parameter  details. 

*/ 

#include  "config.h" 

#include  "mylibs.h" 

#include  " core_obj ective . h" 

#include  " zernikes . h" 


double 

{ 


objective (double 
double 


*obj  ,  double  *aberrin,  double  div  ,  double  *datain , 
*G_obj  ,  double  *G_phsin ,  int  f ,  int  k,  int  m) 


int  cO  , 

cl ,  c2  ; 

// 

int  mm= 

m*m  | 

// 

int  *msk ; 

// 

s ize_t 

dbl  =  sizeof ( double )  ; 

// 

s ize_t 

cplx  =  sizeof ( complex ) ; 

extern 

double  bias ; 

// 

double 

o 

ii 

-j 

// 

double 

*  divaberr ; 

// 

double 

defocus ; 

// 

double 

dtmp  ,  gtmp  ; 

// 

double 

*  zmode  ; 

// 

double 

*data ,  *  aberr ; 

// 

complex 

*gl ,  *g2 ,  *psf 1 ,  *psf2; 

// 

complex 

*fl,  *  f  2 ; 

// 

complex 

♦HI,  *H2 ,  *hl ,  *h2 ; 

// 

complex 

sell ,  scl2  ; 

// 

complex 

*  otmp  ; 

// 

complex 

*  scratch ; 

// 

counters 

number  of  pixels  in  each  frame 
binary  mask  array 

some  sizes  for  malloc  and  friends 

bias  away  from  zero 

objective  function  value 

storage  for  div  aberr  coefficients 

multiplier  for  z-mode  ^  defocus 

scratch 

zernike  mode  array 
working  pointers 

cmplx  array  ptrs 

conv  kernel  for  grads 

arrays  for  OTF  3  s  &  impuls  resp  . 

psf  scaling  constants 

storage  for  fft(obj) 

complex  scratch  array 


//  allocate  storage  arrays.  All 
//  with  f f tw_mall o c  ()  and  freed 
//  with  malloc  ()  and  freed  with 
gl  =  f f tw_malloc (mm*cplx) ; 
g2  =  f f tw_malloc (mm*cplx) ; 
psfl  =  f f tw_malloc (mm* cplx ) ; 
psf2  =  f f tw_malloc (mm* cplx ) ; 

HI  =  f f tw_malloc (mm*cplx) ; 

H2  =  f f tw_malloc (mm*cplx) ; 
hi  =  f f tw_malloc (mm*cplx) ; 
h2  =  f f tw_malloc (mm*cplx) ; 
fl  =  f f tw_malloc (mm*cplx) ; 
f2  =  f f tw_malloc (mm*cplx) ; 
scratch  =  f f tw_malloc (mm* cplx ) ; 
otmp  =  f f tw_malloc (mm* cplx ) ; 
divaberr  =  malloc (k* dbl ) ; 
zmode  =  malloc (mm* dbl ) ; 
msk  =  malloc (mm* sizeof ( int )) ; 


arrays  that  are  fft’d  need  to  be  allocated 
with  fftw_free(),  others  can  be  allocated 


free  () 

// 

model  array 

// 

div  model 

// 

psf  array 

// 

div  psf  array 

// 

focal  OTF 

// 

div  OTF 

// 

impu Is  e  resp . 

// 

div  impuls  resp. 

// 

conv  kernel  for 

obj 

// 

conv  kernel  for 

G_ob  j 

// 

complex  scratch 

// 

fft  (obj) 

// 

div  aberration 

// 

zernike  mode 

// 

mask  array 

//  initialize  variables  and  do  frame-independent  stuff 

L  =  0 ;  //  initialize  L  to  zero  .. . 

defocus  =  div*M_PI / sqrt (3) ;  //  amount  of  z ^  to  add  to  div  chanel 


140 


70 

75 

80 

85 

90 

95 

100 

105 

110 

115 

120 

125 

130 


//  compute  the  fft  of  the  object 
rl_t o_cmplx_copy ( obj  ,  gl , mm) ; 
f  f  t2  (gl  ,  otmp  ,  m  ,  m)  ; 


:ycle  t 

hrough 

the 

frames 

of  data 

and 

ac 

cumu late 

th 

e 

resu 

Its  .  . 

o 

o 

II 

o 

cO <f  ; 

cO  ++ 

) 

//  get 

point e 

r  to 

this 

frame  ’ s  d 

at  a  , 

d 

iv  , 

and 

abe 

rr 

arr 

ay 

data  = 

dat ain 

+  c 

0*2*  mm 

;  // 

p  0 't 

int 

er 

to  da 

t  a 

fr 

ames 

aberr 

=  aberr 

in  + 

cO *k  ; 

// 

p  0  'i 

i  n  t 

er 

to  fr 

ame 

a 

berr 

at  ion 

///////////////////////////////////////////////////////////////////// 
//  make  the  OTF’s,  asf’s,  psf’s  and  the  blurred  images 
//  The  OTF’s  and  asfs  are  stored  for  use  in  the  gradient 
//  calculation.  The  psf’s  are  stored  for  use  in  the  gradient  wrt 
//  intensity  calculations. 

///////////////////////////////////////////////////////////////////// 


// - 

//  start  with  channel  1 

mks cr een (HI  ,  msk  ,  m,  aberr  ,  k) 

if ft2 (HI , hi ,  m  ,  m)  ; 

cmplx_f f tshif t (hi  ,  m,  m) ; 

for  (cl=0;  cl<mm;  cl++) 

*(psfl+cl)  =  * ( hi  +  c 1 )  *  con j 
sell  =  1/ cmplx_sum (psf 1  ,  mm)  ; 
cmplx_sclr_mult (psfl ,scll ,mm) ; 
cmplx_sclr_mult (HI , csqrt (sell) 
cmplx_sclr_mult (hi , csqrt (sell) 

// - 


//  now  d 

o  channel  2 

rl_copy  ( 

aberr ,  divab 

err  ,  k)  ; 

* ( di vabe 

rr+2)  +=  def 

ocus  ; 

mks  cr een 

(H2  ,  msk  ,  m  , 

divaberr 

if f t2 (H2 

,  h2  ,  m  ,  m)  ; 

cmplx_f  f 

t shift (h2  ,  m 

,  m)  ; 

f  or ( cl =0 

;  cl <mm ;  c 1 + 

+  ) 

* (psf 2 

+  c  1 )  =  *(h2+ 

cl)  *  con 

scl2  =  1 

/ cmplx_ sum  ( p 

sf 2 , mm)  ; 

cmplx_  s  c 

lr_mult (psf 2 

, sc 12 , mm) 

cmplx_sc 

lr_mult (H2  ,  c 

sqrt ( scl2 

cmplx_sc 

lr_mult (h2 , c 

sqrt ( scl2 

// 

make  the  first  OTF 

// 

make  the  first  asf 

// 

recenter 

*(hl  +  cl))  ; 

// 

mod- square 

// 

get  scale  cons t 

// 

scale  to  unit-sum 

mm)  ; 

// 

scale  HI 

mm)  ; 

// 

scale  hi 

// 

copy  aberr  params 

// 

add  defocus 

k)  ; 

// 

make  2nd  OTF 

// 

make  2nd  asf 

// 

recenter 

*(h2+cl) ) ; 

// 

mod -square 

// 

get  scale  cons t 

// 

scale  to  unit-sum 

mm)  ; 

// 

scale  H 2 

mm)  ; 

// 

scale  h2 

///////////////////////////////////////////////////////////////////// 
//  make  the  model  data  by  convolving  obj  with  psf’s 
///////////////////////////////////////////////////////////////////// 


// - 

//  start  with  channel  1 

fft2(psfl,  fl,  m  ,  m) ; 

cmplx_dotmult iply ( f 1 ,  fl,  otmp,  mm); 
ifft2(fl,  gl ,  m,  m) ; 
cmplx_f ft  shift (gl ,  m,  m)  ; 

// - 

//  do  channel  2 

fft2(psf2,  f2 ,  m,  m) ; 

cmplx_dotmult iply ( f 2 ,  f 2 ,  otmp,  mm); 
ifft2(f2,  g2 ,  m,  m) ; 
cmplx_f ft  shift (g2 ,  m,  m)  ; 

///////////////////////////////////////////////////////////////////// 
//  compute  L  and  fl,  and  f2  for  the  gradient 
///////////////////////////////////////////////////////////////////// 
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f  or 

{ 


} 


(cl=0;  cl<mm;  cl++) 


// - 

//  do  channel  1  contribs 
dtmp  =  *(data+cl)  +  bias; 
gtmp  =  creal (* (gl+cl ) )  +  bias; 

L  +=  log(gtmp)  +  dtmp/gtmp; 
*(fl+cl)  =  (1  -  dtmp /gtmp )/ gtmp ; 

// - 

//  do  channel  2  contribs  (same  as 
dtmp  =  * ( dat a+mm+c 1 )  +  bias; 
gtmp  =  cr eal  ( * ( g2 +  c 1 ) ) +bias ; 

L  +=  log(gtmp)  +  dtmp/gtmp; 
*(f2+cl)  =  ( 1 - dtmp / gtmp )/ gtmp ; 


// 

add 

bias 

to  the  data 

// 

model 

+  bi 

as 

// 

compu 

te  L 

contribut ion 

// 

store 

this 

for  gradient 

ove 

) 

// 

add 

bias 

// 

get 

mode  l 

+  bias 

// 

compute  L 

contrib 

// 

get 

fZ  for 

gradient 

///////////////////////////////////////////////////////////////////// 
//  compute  gradient  uirt  I,  use  gl  and  g2  as  complex  scratch  arrays. 
///////////////////////////////////////////////////////////////////// 


// - 

//  start  with  channel  1 


f  f  t2  (f  1  ,  gl  ,  m  ,  m)  ; 

// 

fft  fi 

flipCpsfl,  scratch,  m) ; 

// 

flip  the  psf 

f ft2 ( scratch ,  psfl,  m,  m) ; 

// 

fft  the  psf 

cmplx_dotmult iply (gl ,  psfl,  gl  ,  mm); 

// 

mu  It iply 

ifft2(gl,  scratch,  m,  m) ; 

// 

ifft  the  result 

cmplx_f ft  shift ( scratch ,  m,  m)  ; 
for  (cl=0;  cl<mm;  cl++) 

// 

recenter 

* ( G_ob j +cl )  +=  creal  (*( scratch+cl )) ; 

// 

add  up  contributions 

// - 

//  now  do  channel  2 

f  f  t2  (f  2  ,  g2  ,  m  ,  m)  ; 

// 

fft  fz 

flip(psf2,  scratch,  m) ; 

// 

flip  the  psf 

f ft2 ( scratch ,  psf 2 ,  m,  m) ; 

// 

fft  the  psf 

cmplx_dotmult iply (g2 ,  psf 2  ,  g2  ,  mm); 

// 

mu  It iply 

ifft2(g2,  scratch,  m,  m) ; 

// 

ifft  the  result 

cmplx_f ft  shift ( scratch ,  m,  m)  ; 
for  (cl=0;  cl<mm;  cl++) 

// 

recenter 

* ( G_ob j +cl )  +=  creal  (*( scratch  +  cl )) ; 

// 

add  up  contribut ons 

////////////////////////////////////////////////////////////////////// 

//  calculate  the  gradient  wrt  the  aberration  parameters  now... 

//  use  gl ,  g2 ,  psfl,  and  psf2  for  scratch  arrays 

////////////////////////////////////////////////////////////////////// 


(cl 

=2;  c 1 <k ;  cl++) 

//  cy 

cle 

through 

aberrat ion 

parameters 

//  St 

art 

at  2  to 

skip  tip/ti 

It 

// 

make  the  zernike 

mode 

for 

the  current  derivat 

ive 

zer 

nike _gr id ( zmode  , 

msk  , 

cl+2 

,  m)  ; 

//  add  2  to 

skip  piston 

//- 

// 

do  channel  1  con 

tribs 

rl_cmplx_dotmult iply (psf 1 ,  zmode ,  HI,  mm); 
cmplx_ s c lr _mult (psf 1 ,  I,  mm); 


//  phi_n*Hl 
//  multiply  by  j 


//  do  the  ifft  and 
ifft2(psfl,  gl ,  m , 
cmplx_f ft  shift (gl , 
con j  _ dot mult iply (gl 


mu  It iply 
m)  ; 

m  ,  m)  ; 

,  hi ,  mm) 


by 


the 


impulse  responses 
//  ifft 
//  recenter 
//  2*Re(conj(hl)*. 


.) 
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//  convolve  with  the  object 

fft2(gl,  psfl,  m ,  m) ; 

// 

fft 

cmplx_dotmul t iply ( gl ,  psfl,  otmp ,  mm); 

// 

mu  It iply 

ifft2(gl,  psfl,  m,  m) ; 

// 

if  ft 

cmplx_f f tshif t (psf 1 ,  m,  m) ; 

// 

recenter 

//  add  the  contributions 
for  (c2=0;  c2<mm;  c2++) 

*(G_phsin  +  cO*k  +  cl)  +=  * ( f 1 +  c2 ) * cr eal  ( * ( psf 1 +  c2 ) ) ; 


// - 

//  now  do  channel  2  contribs 

r l_cmplx_dotmult iply (psf 2 ,  zmode ,  H2 ,  mm);  //  phi_n*H2 


cmplx_sclr_mult (psf 2  ,  I  ,  mm)  ; 

// 

multiply  by  j 

//  do  the  ifft  and  multiply  by  the  impulse 

responses 

ifft2(psf2,  g2 ,  m,  m) ; 

// 

ifft 

cmplx_ff tshif t (g2 ,  m,  m) ; 

// 

recenter 

con j _dotmult iply (g2 ,  h2 ,  mm); 

// 

2*Re(conj(h2)*.  . 

.) 

//  convolve  with  the  object 

fft2(g2,  psf 2 ,  m,  m) ; 

// 

fft 

cmplx_dotmul t iply ( g2 ,  psf 2 ,  otmp,  mm); 

// 

mu  It iply 

ifft2(g2,  psf 2 ,  m,  m) ; 

// 

ifft 

cmplx_f f tshif t (psf 2 ,  m,  m) ; 

// 

recenter 

//  add  the  contributions 
for  (c2=0;  c2<mm;  c2++) 

*(G_phsin+cO*k+cl)  +=  * ( f 2 +c2 ) * cr eal ( * ( psf 2 +c2 ) ) ; 


} 

> 

///////////////////////////////////////////////////////////////////////// 

//  free  up  the  allocated  arrays 

///////////////////////////////////////////////////////////////////////// 


tw 

.free 

(gi) ; 

tw 

.free 

(g2)  ; 

tw 

.free 

(psf 1 )  ; 

tw 

.free 

(psf 2) ; 

tw 

.free 

(HI)  ; 

tw 

.free 

(  H2  )  ; 

tw 

.free 

(hi)  ; 

tw 

.free 

(h2)  ; 

tw 

.free 

(fl)  ; 

tw 

.free 

(  f  2  )  ; 

tw 

.free 

( scratch) 

ee 

( di vaberr ) ; 

ee 

( zmode )  ; 

ee 

(msk) 

; 

///////////////////////////////////////////////////////////////////////// 
//  return  L.  The  gradients  are  passed  out  as  arrays 
///////////////////////////////////////////////////////////////////////// 
return  L; 


void  mkscreen ( complex  ^screen,  int  *msk  ,  int  m,  double  *aberr,  int  k) 

{ 

int  cO,  cl,  mm=m*m; 
double  *screentmp; 


screentmp  =  malloc (mm* s izeof ( double )) ; 
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//  make  the  phase  screen 

for  (c0=2;  cO<k;  cO  ++)  //  start  defocus  to  avoid  &  tip/tilt 

{ 


zernike.gr id ( s creentmp  ,  msk ,  cO  +  2,  m)  ; 
rl_sclr_mult ( screentmp  ,  * (aberr  +  cO)  ,  mm); 
if  ( cO  =  =  2 ) 

rl_to_cmplx_copy (screentmp ,  screen,  mm); 
else 

for  (cl=0;  cl<mm;  cl++) 

* ( screen+cl )  +=  * ( screentmp+cl ) ; 

> 


//  make  the  circular  mode 
//  scale  it 

//  preload  1  mode 


//  turn  it  into  a  phasor . . . 
for  (c0=0;  cO<mm;  c0++) 

{ 

if  (*(msk+cO)  ==  1) 

*(screen  +  cO)  =  cexp  ( I  * creal  ( * ( s creen  +  cO ) ) ) ; 
else 

*(screen+cO)  =  0; 

> 

//  free  up  working  memory 
free ( screentmp) ; 


void  con j _dotmult iply ( complex  *out  ,  complex  *in,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0++) 

*(out+cO)  =  2* creal  (* ( out  +  c0)  *  con j  ( * ( in  + cO ) ) )  ; 

> 


void  flip(complex  *in,  complex  *out,  int  m) 

{ 

int  x  ,  y  ,  u  ,  v  ; 

//  this  fctn  is  for  mapping  f(x,y)  =>  f(-x,-y)  with  the  origin  remaining 
//  at  x=0  and  y=0  using  circular  shifts  ( f ft -style ) 
for  (x=0;  x<m;  x++) 

{ 

if  ( x  =  =  0 ) 
u  =  0  ; 
else 

u  =  m  -  x  ; 

for  (y=0;  y<m;  y++) 

{ 

if  (y ==0) 
v  =  0; 
else 

v=m-y ; 

*  (out  +  u  +  m*v)  =  *  ( in  +  x  +  m*y)  ; 

> 

> 

return ; 

> 


Listing  A. 9:  pscLreg.h 

//  # include  "mylibs.h" 

/*  This  file  should  contain  the  functions  needed  to  compute  the 
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*  intensity  regularization  based  on  the  pupil-plane  speckle  data  and 

*  the  current  object  autocorrelation  data. 

*/ 

double  psd_r eg ( double  *obj ,  double  *datain ,  double  *G,  int  m) ; 

/*  ================================================================== 

*  double  psd_reg  (double  *obj,  double  *data,  double  *G,  int  m,  int  f); 

* 

*  master  function  for  computing  the  speckle-psd  regularization  term 

*  and  associated  gradients.  The  function  returns  the  value  of  the 

*  objective  function ,  and  stores  the  gradients  wrt  to  each  object 

*  pixel  in  the  array  pointed  to  by  G. 

* 

*  double  *  ob  j  ==  pointer  to  m  x  m  object  array 

*  double  *data  ==  pointer  to  m  x  m  array  of  averaged  p-p  speckle  psd’s 

*  double  *G  ==  pointer  to  m  x  m  gradient  array 

*  int  m  ==  number  of  pixels  along  each  dimension  of  the  input  array 

*  int  f  ==  number  of  frames  used  to  generate  the  data  array 
*/ 


Listing  A. 10:  pscLreg.c 

/ *  This  file  should  contain  the  functions  needed  to  compute  the 

*  intensity  regularization  based  on  the  pupil-plane  speckle  data  and 

*  the  current  object  autocorrelation  data.  See  psd_reg.h  for  calling 

*  sequence  etc  .  .  . 

* 

*  this  has  been  verified  agains  finite  differences  for  the  gradient  ,  and  the 

*  objective  is  correct  as  well.  Need  the  input  data  to  be  unit -mean 
*/ 

#include  "mylibs.h" 

#include  "config.h" 


double  psd_r eg ( double  *obj  , 

{ 

extern  double  bias ; 
int  cO  ; 
int  mm=m*m; 
double  * ocor  ; 
double  L; 

double  gtmp ,  dtmp ; 
complex  *ftmp,  *otmp; 
complex  *tmpl ,  *tmp2  ,  *tmp3; 
double  *data; 


double  *datain ,  double  *G,  int  m) 

//  bias  away  from  I/O 
//  counters 

//  pixels  in  each  frame 
//  working  arrays 

//  return  objective  function  value 
//  scratch  double  variables 
//  complex  working  arrays 
//  more  complex  working  arrays 
//  working  array  for  data.  .  . 


//  allocate  temporary  arrays 
ocor  =  malloc (mm* sizeof ( double ))  ; 
tmpl  =  fftw_malloc (mm* sizeof ( complex )) ; 
tmp2  =  fftw_malloc (mm* sizeof ( complex )) ; 
tmp3  =  fftw_malloc (mm* sizeof ( complex )) ; 
ftmp  =  fftw_malloc (mm* sizeof ( complex )) ; 
otmp  =  fftw_malloc (mm* sizeof ( complex )) ; 
data  =  malloc (mm* sizeof ( double )) ; 


/*  compute  the  object  autocorrelation  */ 

rl_aut ocorr ( ob j ,  ocor,  m) ;  //  do  the  autocorrelation 

/*  copy  the  data  to  a  working  array  */ 
rl_copy(datain ,data , mm) ; 

/*  scale  the  correlation  and  data  to  be  a  mean  of  1  */ 
gtmp  =  0; 

for(c0=0;  cCKmm;  c0+  +  ) 

{ 
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*(ocor+cO)  /=  (double)mm; 
gtmp  +=  *(data+cO); 

> 

for(c0=0;  cCKmm;  c0+  +  ) 

*(data+cO)/=( gtmp /(double) mm) ; 


/*  compute  the  objective  function  contribution 
* 

*  The  objective  function  is  derived  by  assuming  spatially 

*  independent  negative  exponential  statistics  on  the  psd,  where  the 

*  parameter  is  determined  by  the  scaled  object  autocorrelation . 

*  The  log -likelihood  of  the  joint  negative  exponential  distribution 

*  is  used  as  the  likelihood  function 

* 

*  also  compute  a  term  (ftmp)  that  will  be  used  for  the  gradient .. . 
*/ 


//  compute  the  objective  and  the  convolution  function  for  the  grad 
L  =  0; 

for  (c0=0;  cO<mm;  c0++) 

{ 

gtmp  =  *  (  ocor  +  cO )  +bias  ;  //  get  the  model  value 

dtmp  =  * (data+cO) +bias ;  //  get  the  data  value 

L  +=  log(gtmp)  +  dtmp/gtmp;  //  compute  contributon 

*(ftmp+cO)  =  (1  -  dtmp/gtmp ) /gtmp ;  //  term  for  use  in  gradient 

> 


//  Now  compute  the  gradients 

//  do  the  required  convolutions / correlations 


rl_t o_cmplx_copy ( obj  ,  tmpl ,  mm);  // 

fft2(tmpl,  otmp  ,  m,  m)  ;  // 

fft2(ftmp,  tmpl,  m,  m) ;  // 

cmplx_dotmul t iply ( tmp2 ,  tmpl,  otmp,  mm);  // 
ifft2(tmp2,  tmp3 ,  m,  m) ;  // 

cmplx_f f tshif t (tmp3 ,  m,  m) ;  // 

for  (c0=0;  cO<mm;  c0++)  // 

*(tmpl+cO)  =  conj (* ( tmpl+cO) ) ; 
cmplx_dotmul t iply ( tmp2 ,  tmpl,  otmp,  mm);  // 
ifft2(tmp2,  tmpl,  m,  m) ;  // 

cmplx_ff tshif t (tmpl ,  m,  m) ;  // 


copy  the  obj  to  cmplx  arry 
ff t  the  object 
fft  ftmp 

multiply  for  conv 
inverse  fft  it 
f  ft  - shi f  t  it 

conjugate  for  correlation 

multiply  for  corr 
inverse  fft  it 
fft - shi f  t  it 


//  sum  the  contributions  and  spit  out  the  result 
for  (c0=0;  cO<mm;  c0++) 

*(G+cO)  =  ( creal  (* ( tmp3  +  c0) )  +  cr eal  (*( tmpl  +  cO )))/( double ) mm ; 


/*  free  up  all  the  working  arrays  */ 

f  ree ( ocor )  ; 

fftw_free(tmpl)  ; 

fftw_free(tmp2)  ; 

fftw_free(tmp3) ; 

f ftw_f ree ( ftmp )  ; 

fftw_free(otmp)  ; 

free ( data)  ; 

/*  return  the  objective  function  contribution  */ 
return  L; 
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Listing  A. 11:  phs_reg.h 

/*  Peter  Johnson 
*  19  Dec  2006 


* 

*  This  file  contains  the  stuff  required  for 

*  image  reconstruction  problem. 

* 

*/ 


# include 
# include 
# include 
# include 
# include 


" conf ig . h " 

"mylibs . h" 

" zernikes . h" 
<gsl/gsl_linalg .h> 
<math . h> 


phase -regularization 


of 


the 


//extern  int  inverse_exists =0 ;  //  logical  for  use  later  set  it  to  zero  here 

//extern  double  *inverse;  //  storage  for  the  inverse  covariance  matrix 

void  calc_covar_inverse (double  *inv,  double  DrO ,  int  k)  ; 

/*  void  cal c_ cov ar_inv ers e ( doubl e  *inv,  double  DrO ,  int  k)  compute  the 

*  inverse  of  the  noil  -  covariance  matrix  for  a  given  number  of  modes ,  k,  and 

*  a  specified  D/rO .  Allocate  the  array  prior  to  calling  this  fctn. 

*/ 


void  make_inv_covar (double  DrO,  int  k) ; 

/ *  allocate  the  inverse  covariance  matrix ,  compute  it,  and  set  the  flag 

*  saying  that  we  allready  computed  it.  The  matrix  and  flag  are  stored  in 

*  global  variables  *  inverse  and  inv ers e_ exi s t s  respectively 
*/ 

void  f ree_inv_covar  ()  ; 

/*  free  the  inverse  covariance  matrix  and  unset  the  flag 
*/ 


double  noll_covar iance ( int  n,  int  m)  ; 

/*  C  =  noil  _  cov  ari  ance  ( int  n,  int  m)  ; 

*  compute  the  zernike-mode  covariance  between  the  n } th  and  m}th  Noll-indexed 

*  zernike  modes  and  assuming  Kolmogorov  turbulence.  This  can  be  used  to 

*  build  the  covariance  matrix,  which  in  turn  can  be  used  to  generate  phase 

*  screens .  To  get  the  true  covariance,  this  needs  to  be  multiplied  by 

*  (D/rO)  ~(5/3) . 

* 

*  inputs : 

*  int  n  ==  noil-index  of  the  first  mode 

*  int  m  ==  noil-index  of  the  second  mode 

* 

*  output : 

*  double  C  ==  covariance  of  the  requested  modes 
*/ 

double  covar.reg ( double  *aberr ,  double  *grad  ,  int  k,  int  f,  double  DrO); 

/*  L  =  covar_reg  ( double  *aberr,  int  k,  int  f,  double  DrO); 

* 

*  compute  the  regularization  term  derived  from  the  expected  intermodal 

*  covariance . 

* 

*  inputs : 

*  double  *aberr  ==  pointer  to  the  aberration  array 

*  int  k  ==  number  of  aberrations  per  frame 

*  int  f  ==  number  of  frames 

* 

*  outputs: 

*  L  ==  regularization  contribution 

*  double  *grad  ==  pointer  to  the  output  gradient  array 
*/ 
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Listing  A. 12:  phsjreg.c 

/*  Peter  Johnson 

*  8  Mar  2006 

* 

*  This  file  contains  the  stuff  required  for  phase-regularization  o 

*  image  reconstruction  problem.  This  should  eventually  encapsulate 

*  inter-modal  and  the  temporal  statistics. 

* 

*/ 

# if ndef  M_PI 

#def ine  M_PI  3.14159265358979323846  /*  pi  */ 

#endif 

#include  <math.h> 

#include  <stdlib.h> 

#include  "phs_reg.h" 

/ /  int  inv  ers  e_  exi  s  t  s  =0 ;  //  logical  for  use  later 

//double  * inverse  //  storage  for  the  inverse  covariance 

void  calc_covar_inverse (double  *inv,  double  DrO ,  int  k) 

{ 

int  cO,cl;  //  counters 

int  s;  //  signum  for  the  LU  decomp 

double  *covar;  //  pointer  to  covariance  matrix 

double  tmp ; 

double  Dr053  =  pow ( DrO  ,  5 . 0/3 . 0 )  ; 

//  allocate  the  covariance  matrix 
covar  =  malloc (k*k* s izeof ( double )) ; 

//  make  gsl_style  matricies  out  of  the  arrays 
gsl.matr ix_view  covarm  =  gsl_matr ix_view_array ( covar , k , k) ; 
gsl_matr ix_view  invm  =  gsl.matr ix_view_array ( inv , k , k) ; 
gsl_permut at  ion  *p  =  gsl_permut at ion_alloc (k)  ; 

//  fill  in  the  covariance  matrix 
for  (c0=0;  c0<k;  c0++) 

{ 

for  (cl=0;  cl<=c0;  cl++) 

{ 

//  get  the  covariance 

//  have  to  index  starting  at  two  to  skip  piston... 
tmp  =  Dr053*noll_covariance (cO+2 , cl+2) ; 

//  matrix  is  symmetric  so  don’t  overdo  it... 

*(covar  +  cO  +  k*cl)  =  tmp; 
if  (  cl !  =  cO ) 

*(covar+cl  +  k*cO)  =  tmp; 

> 

> 

//  use  gsl  to  compute  the  inverse . 

//  start  by  LU  factoring  the  covariance ,  then  compute  the  inverse 
gsl_linalg_LU_decomp (& covarm . matrix ,  p,  &s); 
gsl_linalg_LU_invert (& covarm . matrix ,  p,  & invm . mat rix ) ; 

gsl_permut at ion_f ree (p) ;  //  free  permutaiton  matrix 

free(covar);  //  free  up  covariance  array 

} 

void  make_inv_covar (double  DrO,  int  k) 

{ 

extern  double  *inverse ; 
extern  int  inverse.exist s ; 
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//  allocate  memory 

inverse  =  malloc ( k*k* s izeof ( double ))  ; 
inverse_exists=l; 

//  compute  the  covariance  matrix 
calc_covar_inverse ( inverse ,  DrO ,  k) ; 


return ; 

> 

void  f ree_inv_covar () 

{ 

extern  double  ^inverse; 
extern  int  inverse_exists  ; 
free ( inverse )  ; 
inver se_exists  =  0; 
return ; 

} 

double  noll_covar iance ( int  i,  int  j) 

{ 

double  C,  A,  B,  D;  //  working  doubles 

int  ni ,  mi,  nj ,  mj ;  //  radial  and  angular  orders  for  the  modes 

int  tmp  ,  one  ; 

//  compute  the  radial  and  angular  mode  numbers 
zernikemode  ( i  ,  &ni  ,  &mi); 
zernikemode  ( j  ,  &nj  ,  &mj )  ; 

//  see  if  we  even  need  to  bother 
if  (  (  (  i  -  j  )  #/02  =  =  O)  &&  (  mi  =  =  m  j  )  ) 

//  get  the  exponent  on  the  negative  1  and  decide  if  -l~m  is  negative 
tmp  =  (ni  +  nj  -  2*mi)/2; 
if  (tmp°/02  ==  0) 
one  =  1 ; 
else 

one  =  -1; 

A  =  0 . 0072* one  * sqrt (( ni  +  1 )*( nj +1 )) *pow (M_PI  ,( double ) 8/3)  ; 

B  =  tgamma (( double ) 14/3) * tgamma (( ni +nj -( double ) 5/3) /2) ; 

D  =  tgamma ( (ni -nj +( double ) 17/3) /2) * tgamma (( nj -ni +( double ) 17/3) /2)  * 
tgamma ((ni+nj+(double)23/3)/2) ; 

C  =  A*B/D ; 

> 

else 

C  =  0; 
return  C; 

} 


double  covar.reg ( double  *aberr ,  double  *grad ,  int  k,  int  f,  double  DrO) 

{ 


//  make  sure  the  grad  vector  is 


int  cO  ,  cl  ,  c2  ;  // 

extern  int  inverse.exist s ;  // 

// 

double  L=0;  // 

double  *frmaber;  // 

double  *frmgrad;  // 

extern  double  *inverse;  // 


//  make  the  inverse  covariance 
//  it  in  the  main  program... 
if  ( ! inverse_exist s ) 


zeros  before  passing  it  to  this  fctn 
counters 

logical  1  if  the  covariance  inverse  has 
already  been  computed 
objective  value 

pointer  to  frame  aberration  params 
pointer  to  frame  aberration  gradient 
pointer  to  covariance  inverse  array 

matrix  if  we  need  it  .  .  .  make  sure  you  free 
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make_inv_covar(DrO ,k) ; 


//  loop  over  the  number  of  frames 
for  (c2=0;  c2<f;  c2++) 

{ 

//  get  pointers 

frmaber  =  aberr  +  c2*k;  //  point  to  this  frame’s  aberr  vector 

frmgrad  =  grad  +  c2*k;  //  point  to  this  frame’s  aberr  gradient 

//  compute  alpha ~ T* inv *  alpha  for  the  regulation  term 
//  and  - inv *  alpha  for  the  gradient  .  .  . 

for  (c0=2;  cO<k;  c0++)  //  start  at  2  to  skip  tip/tilt 

{ 

for  (cl=2;  cl<k;  cl++)  //  start  at  2  to  skip  tip/tilt 

{ 

//  do  first  vector  multiply 

* ( f rmgrad+cO)  +=  *( inverse + c 1 +k* cO )  *  * ( f rmaber + c 1 ) ; 

> 

L  +=  *( f rmgrad+cO)  *  * ( f rmaber+cO) ;  //  add  objective  contribs 

* ( f rmgr ad+ cO )  *=  2.0;  //  multiply  by  2 

> 

> 

//  return  the  objective  function  value... 
return  L; 


Listing  A.  13:  mylibs.h 

/*  Header  file  containing  all  of  the  function  prototypes  for  the  routines 
that  I  have  built  in  c.  Most  of  these  are  to  implement  MATLAB-like 
rout ines .  */ 

#include  <stdlib.h> 

#include  <math.h> 

#include  <complex.h> 

#include  <fftw3.h> 

#include  <time.h> 

/*  define  some  constants  */ 

# if ndef  M_PI 

#def ine  M_PI  3.14159265358979323846  /*  pi  */ 

#endif 

/*  set  up  function  prototypes  */ 

void  rl_copy ( double  *from  ,  double  *to  ,  int  n)  ; 

/*  rl_ copy  ( from ,  to,  n)  copies  the  array  pointed  to  by  from  into  the  array 
pointed  to  by  to.  Both  arrays  need  to  be  real  and  contain  n  elements  */ 

void  rl_t o_cmplx_copy ( double  *from ,  complex  *to  ,  int  n)  ; 

/■ *  rl _t o _cmplx_copy  ( from ,  to,  n)  copies  the  array  pointed  to  by  from  into 

the  array  pointed  to  by  to.  Both  arrays  need  to  be  of  size  n.  From  needs 
to  be  a  pointer  to  a  double,  and  to  a  pointer  to  a  complex.  */ 

void  cmplx_t o_cmplx_copy ( complex  *from,  complex  *to ,  int  n) ; 

/*  cmplx_to_cmplx_copy ( from ,  to,  n)  copies  the  array  pointed  to  by  from  into 
the  array  pointed  to  by  to.  Both  arrays  need  to  be  of  size  n,  and  both 
of  type  complex.  */ 

void  cmplx_t o_rl_copy ( complex  *from  ,  double  *to  ,  int  n)  ; 

/*  cmplx_ro_rl_copy ( from ,  to,  n)  copies  the  real  part  of  the  array  pointed 
to  by  from  into  the  array  pointed  to  by  to.  Both  arrays  need  to  be  of 
size  n.  From  needs  to  be  a  pointer  to  a  complex,  and  to  a  pointer  to  a 
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double .  */ 


void  cmplx_abs_copy (complex  *from  ,  double  *to  ,  int  m)  ; 

/*  cmp lx_ ab s _ copy  ( co mp l ex  *from,  double  *to ,  int  m) ;  copies  the  absolute 

value  of  an  m-element  array  from  the  complex  array  pointed  to  by  *from  to 
the  double  array  pointed  to  by  *to  */ 

void  abssq ( complex  *array,  int  m) ; 

/*  abssqCarray,  m,  n)  computes  the  magnitude  squared  of  every  element  in  the 
m-element  array  pointed  to  by  array.  The  array  must  be  complex.  */ 

void  rl_sclr_mult (double  *array,  double  c,  int  n) ; 

/*  rl_sclr_mult  ( array ,  c,  n)  multiplies  every  element  of  the  n-element  array 
by  the  scalar  c.  array  must  point  to  an  array  of  type  double  */ 

void  cmplx_sclr_mult (complex  *array,  complex  c,  int  n) ; 

/*  cmp lx_ s clr_mul t  ( array ,  c,  n)  multiplies  every  element  of  the  n-element 
array  by  the  scalar  c.  array  must  point  to  an  array  of  type  complex  */ 

void  rl_f ftshift (double  *array,  int  n,  int  m) ; 

/*  implement  the  f ft  shift  for  2-D  arrays  to  put  the  origin  at  the  center  of 
the  array.  Only  works  for  even- sized  double  arrays.  */ 

void  cmplx_f ft  shift ( complex  *array,  int  n,  int  m)  ; 

/*  implement  the  f ft  shift  for  2-D  arrays  to  put  the  origin  at  the  center  of 
the  array.  Only  works  for  even-sized  complex  arrays.  */ 

void  f ft 2 ( complex  *in,  complex  *out,  int  m,  int  n) ; 

/*  wrapper  for  the  fftw  to  make  2-d  fft’s  easier.  If  coupled  with  the 
ifft2()  function  below ,  this  is  a  scaled  transform  pair ,  with  all  the 
scaling  done  in  the  inverse  transform.  For  optimal  performace,  the  arrays 
should  be  allocated  with  the  f f tw_mall o c  ()  function.  */ 

void  if ft2 ( complex  *in,  complex  *out ,  int  m,  int  n) ; 

/*  wrapper  for  the  fftw  to  make  inverse  2-d  fft’s  easier.  When  paired  with 
fft2()  above,  this  is  a  scaled  transform  pair  with  all  of  the  scaling 
done  in  the  ifft.For  optimal  performace,  the  arrays  should  be  allocated 
with  the  f f tw_mal l o c ()  function.  */ 

void  inplace_ff t2 ( complex  *in,  int  m,  int  n) ; 

/*  wrapper  for  in-place  fftw  2-d  transform.  When  paired  with 

inpl ace_ifft2  ()  below,  this  is  a  scaled  transform  pair  with  all  of  the 
scaling  done  in  the  if ft.  For  optimal  performace,  the  arrays  should  be 
allocated  with  the  f f tw_mall o c  ()  function.  */ 

void  inplace_iff t2 ( complex  *in,  int  m,  int  n) ; 

/*  wrapper  for  in-place  fftw  inverse  2-d  transform.  When  paired  with 

inolace_f ft2 ()  above,  this  is  a  scaled  transform  pair  with  all  of  the 
scaling  done  in  the  if ft.  For  optimal  performace,  the  arrays  should  be 
allocated  with  the  f f tw_mal l o c  ()  function.  */ 

void  dotmult iply ( double  *out  ,  double  *inl,  double  *in2,  int  m) ; 

/*  void  dotmult iply  ( doubl e  *out,  double  *inl ,  double  *in2,  int  m) 

* 

*  out  ==  pointer  to  the  output  array 

*  ini  ==  pointer  to  the  first  input  array 

*  in2  ==  pointer  to  the  second  input  array 

*  n  ==  integer  number  of  elements  in  the  array 

* 

*  implements  the  equivalent  of  the  .*  operator  in  matlab  such  that 

*  out(i)  =  ini  ( i ) . * in2 ( i ) 

*/ 

void  cmplx_dotmult iply ( complex  *out,  complex  *inl,  complex  *in2,  int  m) ; 

/*  void  dotmultiply ( double  *out,  double  *inl ,  double  *in2,  int  m) 

* 
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*  out  ==  pointer  to  the  output  array 

*  ini  ==  pointer  to  the  first  input  array 

*  in2  ==  pointer  to  the  second  input  array 

*  n  ==  integer  number  of  elements  in  the  array 

* 

*  implements  the  equivalent  of  the  .*  operator  in  matlab  such  that 

*  out(i)  =  ini  (i)  .  *  in2  (i) 

*/ 

void  rl_cmplx_dotmult iply ( complex  *out ,  double  *inl,  complex  *in2,  int 
m)  ; 

/*  void  dotmultiply ( complex  *out ,  double  *inl ,  complex  *in2,  int  m) 

* 

*  out  ==  pointer  to  the  output  array 

*  ini  ==  pointer  to  the  first  input  array 

*  in2  ==  pointer  to  the  second  input  array 

*  n  ==  integer  number  of  elements  in  the  array 

* 

*  implements  the  equivalent  of  the  .*  operator  in  matlab  such  that 

*  out(i)  =  ini  ( i ) . * in2 ( i ) 

*/ 

double  factorial (double  n) ; 

/*  uses  recursion  to  compute  the  factorial  of  an  integer.  Will  crap 
out  for  large  integers...  */ 

void  rl.autocorr ( double  *in,  double  *out ,  int  n) ; 

/ *  ================================================================== 

*  void  rl _ aut o corr ( doubl e  *in,  double  *out,  int  n) 

* 

*  This  function  computes  the  autocorrelation  function  of  a 

*  real-valued  n  x  n  array  using  the  fft.  The  function  must  be  square 

*  for  this  to  work  as  advertised. 

* 

*  double  *in  ==  pointer  to  input  array 

*  double  *out  ==  pointer  to  output  array 

*  int  n  ==  number  of  elements  along  each  dimention  of  the  input  array 
*/ 

void  spec_psd ( double  *in,  double  *out ,  int  n) ; 

/*  =================================================================== 

*  void  sp ec_p s d ( doubl e  *array,  double  *rslt  ,  int  n) 

* 

*  This  function  computes  the  non-normalized  delta-removed  speckle 

*  psd.  In  theory  this  psd  is  related  to  the  reflecting  object’s 

*  autocorrelation  through  a  simple  scaling  constant. 

* 

*  double  *in  ==  pointer  to  the  input  array 

*  double  *out  ==  pointer  to  the  output  array 

*  int  n  ==  number  of  pixels  along  each  dimention  of  the  input  array 
*/ 

void  rl_normal ize ( double  *in,  int  m) ; 

/*  =================================================================== 

*  void  rl_normalize ( double  *in,  int  m)  normalizes  a  real-valued  array 

*  of  m  elements  such  that  the  maximum  value  of  any  element  is  1. 

* 

*  double  *in  ==  pointer  to  the  first  array  element 

*  int  m  ==  integer  number  of  elements  in  the  array 
*/ 

double  rl_max ( double  *in,  int  m)  ; 

/ *  =================================================================== 

*  double  rl _max ( doubl e  *in,  int  m)  returns  the  value  of  the  largest 

*  element  of  the  input  array 

* 
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*  double  *in  ==  pointer  to  the  input  array 

*  int  m  ==  integer  number  of  elements  in  the  array 

* 

*  returns  mx  ==  maximum  value  in  the  array 
*/ 

void  cmplx_normalize (complex  *in,  int  m) ; 

/*  ========================================================================= 

*  void  cp lx_normal i z e ( complex  *in,  int  m)  normalizes  a  compl ex -v alued  array 

*  of  m  elements  such  that  the  maximum  absolute  value  of  any  element  is  1. 

* 

*  complex  *in  ==  pointer  to  the  first  array  element 

*  int  m  ==  integer  number  of  elements  in  the  array 
*/ 

double  cmplx_max ( complex  *in,  int  m)  ; 

/ *  =================================================================== 

*  double  cplx_max  ( compl ex  *in,  int  m)  returns  the  largest  absolute  value 

*  of  the  input  array 

* 

*  complex  *in  ==  pointer  to  the  input  array 

*  int  m  ==  integer  number  of  elements  in  the  array 

* 

*  returns  mx  ==  maximum  absolute  value  in  the  array 
*/ 

double  rl_sum ( double  *in,  int  m)  ; 

/ *  ================================================================== 

*  double  rl _sum ( doubl e  *in,  int  m) ; 

* 

*  computes  the  sum  of  the  elements  in  the  array  pointed  to  by  *in 

* 

*  double  *in  ==  pointer  to  the  array 

*  in  m  ==  number  of  elements  in  the  array 

* 

*  returns  S  =  sum  (in); 

*/ 

complex  cmplx_sum ( complex  *in,  int  m)  ; 

/*  ================================================================== 

*  double  cmplx_sum ( double  *in,  int  m) ; 

* 

*  computes  the  sum  of  the  elements  in  the  array  pointed  to  by  *in 

* 

*  double  *in  ==  pointer  to  the  array 

*  in  m  ==  number  of  elements  in  the  array 

* 

*  returns  S  =  sum (in); 

*/ 

double  unit .scale ( double  *in,  int  m) ; 

/*  ================================================================== 

*  double  unit _s cal e  ( doubl e  *in,  int  pixels) 

* 

*  scales  the  input  array  to  have  unit-average  values  and  returns  the 

*  required  scaling  constant. 

* 

*  double  *in  ==  pointer  to  the  array 

*  int  m  ==  number  of  elements  in  the  array 

*  double  (return  value)  ==  multiplicative  scaling  constant  used  to 

*  get  unit-average  intensity . 

*/ 

Listing  A.  14:  mylibs.c 

#include  "mylibs.h" 
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/*  functions  to  copy  the  contents  of  arrays  */ 


void  rl_copy ( double  *from  ,  double  *to  ,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0++) 

*(to  +  cO)  =  *(from  +  cO); 
return ; 

> 

void  rl_t o_cmplx_copy ( double  *from  ,  complex  *to  ,  int  m) 

{ 

int  cO  ; 
complex  tmp ; 

for  (c0=0;  cO<m;  c0+  +  ) 

{ 

tmp  =  *(from+cO); 

* (to+cO)  =  tmp ; 

> 

return ; 

> 

void  cmplx_t o_cmplx_copy ( complex  *from,  complex  *to  ,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0++) 

*(to  +  cO)  =  *(from  +  cO); 
return ; 

> 

void  cmplx_t o_rl_copy ( complex  *from  ,  double  *to  ,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0++) 

*(to  +  cO)  =  creal (*(from  +  cO)); 
return ; 

} 

void  cmplx_abs_copy ( complex  *from  ,  double  *to  ,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0+  +  ) 

*(to  +  cO)  =  cabs  ( *  ( f  rom  +  cO)); 
return ; 

> 

/*  function  to  compute  the  magnitude  squared*/ 
void  abssq ( complex  *array,  int  m) 

{ 

int  cO  ; 

complex  tmp ; 

for  (c0=0;  cO<m;  c0++) 

{ 

tmp  =  *(array  +  cO); 

*(array  +  cO)  =  tmp  *  conj (tmp) ; 

> 

return ; 

> 

/*  scalar  dot  multiply  functions  for  arrays  */ 
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void  rl_sclr_mult (double  *array,  double  c,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0++) 

*(array  +  cO)  =  c  *  (*(array  +  cO)); 
return ; 


void  cmplx_sclr_mult ( complex  *array ,  complex  c,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0++) 

*(array  +  cO)  =  *(array+cO)  *  c; 
return ; 


/*  routine  to  do  the  fft-shift  ( i.e .  put  dc  at  the  center  of  the 
shift)  */ 

void  rl_fftshift (double  *array,  int  m,  int  n) 

{ 

int  cO  ,  cl  ; 
double  *  tmp ; 

tmp  =  malloc (m*n* sizeof ( double )) ; 

int  x  ,  y  ,  xm  ,  ym  ; 
x  =  m/2 ; 
y  =  m/2; 

for  (c0=0;  cO<m;  c0++) 

{ 

for(cl=0;  cl<n;  cl+  +  ) 

{ 

xm  =  cO  -  x;  if  (xm  <  0)  xm  +=  m; 

ym  =  cl  -  y;  if  (ym  <  0)  ym  +=  n; 

*(tmp  +  cO  +  m*cl)  =  *(array  +  xm  +  m*ym) ; 

> 

> 

rl_copy(tmp,  array, m*n)  ; 

f  r ee ( tmp )  ; 
return ; 

> 

void  cmplx_f ft  shift ( complex  *array,  int  n,  int  m) 

{ 

int  cO  ,  cl  ; 
complex  * tmp ; 

tmp  =  malloc (m*n* sizeof ( complex ))  ; 

int  x  ,  y  ,  xm  ,  ym  ; 
x  =  m/2 ; 
y  =  m/2; 

for  (c0=0;  cO<m;  c0++) 

{ 

for(cl=0;  cl<n;  cl++) 

{ 

xm  =  cO  -  x;  if  (xm  <  0)  xm  +=  m; 

ym  =  cl  -  y;  if  (ym  <  0)  ym  +=  n; 

*(tmp  +  cO  +  m*cl)  =  *(array  +  xm  +  m*ym) ; 

> 

> 

cmplx_t o_cmplx_copy (tmp  ,  array , m*n)  ; 
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f ree ( tmp )  ; 
return ; 

> 

void  f ft 2 ( complex  *in,  complex  *out,  int  m,  int  n) 

{ 

/*  wrapper  for  the  fftw  to  make  2-d  fft3s  easier  */ 

/*  for  best  performance  make  sure  that  the  "in"  and  "out"  arrays  are 
allocated  with  the  f ftw_malloc ()  function.  */ 

f f tw_pl an  pf ; 

/*  set  up  the  fft  routine  */ 

pf  =  fftw_plan_dft_2d(m,n, in , out , FFTW_ FORWARD  ,FFTW_ESTIMATE)  ; 

/*  19  Jan  2006...  for  some  reason  the  fft  routine  called  with  FFTW_MEASURE 
gives  inconsistent  results .. .  doesnt3  always  work.  FFTW_ESTIMATE  seems 
to  be  OK*/ 

f f tw.execute (pf ) ;  /*  do  the  transform  */ 

// f  ftw_  c  l  eanup  ()  ; 
f f tw_destroy_plan (pf ) ; 

> 

void  if ft2 ( complex  *in,  complex  *out  ,  int  m,  int  n) 

{ 

/*  wrapper  for  the  fftw  to  make  inverse  2-d  fft  3s  easier  */ 

/*  for  best  performance  make  sure  that  the  "in"  and  "out"  arrays  are 
allocated  with  the  f f tw_mal l o c ()  function.  This  also  implements  the 
scaling  required  to  make  this  a  scaled  transform  pair  when  paired  with 
fft2()  above.  */ 

f f tw_plan  pf ; 

/*  set  up  the  fft  routine  */ 

pf  =  fftw_plan_dft_2d(m,n, in , out , FFTW_ BACKWARD  ,FFTW_ESTIMATE)  ; 

/ *  19  Jan  2006.  .  .  for  some  reason  the  fft  routine  called  with  FFTW_MEASURE 
gives  inconsistent  results .. .  doesnt 3  always  work.  FFTW_ESTIMATE  seems 
to  be  OK*/ 

f f tw.execute (pf ) ;  /*  do  the  transform  */ 

cmplx_sclr_mult  (out  ,  1/ ( double ) (m*n)  ,  m*n)  ;  /*  fix  the  scaling  .  */ 

// f ftw_cl eanup  () ;  /*  cleanup  fftw  */ 

f f tw_destroy_plan (pf ) ; 


void  inplace_ff t2 ( complex  *ary,  int  m,  int  n) 

{ 

/*  inplace  ffts  are  slower  than  the  others .. .  */ 

/*  in-place  2-d  fft ,  see  fft2()  above  */ 
f f tw_plan  pf ; 

pf  =  fftw_plan_dft_2d(m,n, ary , ary , FFTW .FORWARD ,FFTW_ESTIMATE) ; 
fftw_execute (pf ) ; 
f f tw_cleanup () ; 
f f tw_destroy_plan (pf ) ; 


void  inplace_iff t2 ( complex  *ary ,  int  m,  int  n) 

{ 

/*  inplace  ffts  are  slower  than  the  others .. .  */ 

/*  in-place  2-d  inverse  fft  with  normalization ,  see  ifft2()  above  */ 
f f tw_plan  pf ; 
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pf  =  fftw_plan_dft_2d(m,n, ary , ary , FFTW_ BACKWARD  ,FFTW_ESTIMATE)  ; 
fftw_execute(pf) ; 

cmplx_sclr_mult  (ary  ,  1/ (  double  )  (m*n)  ,  m*n)  ;  //  fix  scaling 
f f tw_cleanup ( ) ; 
f f tw_destroy_plan (pf ) ; 


/*  matlab  like  dot  multiply  */ 

void  dotmult iply ( double  *out  ,  double  *inl,  double  *in2,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0++) 

*(out+cO)  =  *(inl+cO)  *  (*(in2+c0)); 

} 

void  cmplx_dotmult iply ( complex  *out,  complex  *inl,  complex  *in2,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0+  +  ) 

*(out+cO)  =  *(inl+cO)  *  (*(in2+c0)); 

} 

void  rl_cmplx_dotmult iply ( complex  *out  ,  double  *inl,  complex  *in2,  int  m) 

{ 

int  cO  ; 

for  (c0=0;  cO<m;  c0++) 

*(out+cO)  =  *(inl+cO)  *  (*(in2+c0)); 

} 

double  rl_sum ( double  *in,  int  m) 

{ 

int  cO  ; 
double  S=0; 

for  (c0=0;  cO<m;  c0+  +  ) 

S  +=  * ( in  +  cO )  ; 
return  S; 

} 

complex  cmplx_sum ( complex  *in,  int  m) 

{ 

int  cO  ; 

complex  S=0; 

for  (c0=0;  cO<m;  c0++) 

S  +=  *(in+cO); 
return  S; 

} 

/*  factorial  function  for  ints  */ 
double  factorial (double  n) 

{ 

if  (  ( int ) n  =  =  0)  return  1; 

else  return  (n*f actorial (n-1) ) ; 

} 

/■ *  scaled  autocorrelation  function  for  real  data  */ 
void  rl_autocorr ( double  *in,  double  *out ,  int  n) 

{ 

complex  *tmpl ,  *tmp2;  //  pointer  to  complex  data  type 

tmpl  =  ff tw_malloc (n*n* sizeof ( complex )) ; 
tmp2  =  ff tw_malloc (n*n* sizeof ( complex )) ; 

rl_to_cmplx_copy ( in ,  tmpl,  n*n) ;  //copy  input  to  working  array 
fft2(tmpl,  tmp2 ,  n,  n) ;  //  do  the  fft 

abssq(tmp2,  n*n) ;  //  get  mag  squared 
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ifft2(tmp2,  tmpl ,  n ,  n) ; 
cmplx_t o_rl_copy ( tmpl ,  out,  n*n) ; 
rl_f ft  shift (out  ,  n,  n)  ; 
fftw_free(tmpl)  ; 
fftw_free(tmp2)  ; 

> 

/*  normalization  function  */ 
void  rl_normal ize ( double  *in,  int  m) 

{ 

int  cO  ; 
double  mx ; 

mx  =  rl_max  (in  ,m)  ;  //  find  the  maximum 
for  (c0=0;  cO<m;  c0++) 

*(in+cO)  =  *  (  in  + cO ) /mx  ;  //  divide  it  out 


double  rl_max ( double  *in,  int  m) 

{ 

int  cO  ; 
double  mx ; 
mx  =  *  in ; 

for  (cO=l;  cO<m;  c0++) 
if  (*(in+cO)  >  mx) 
mx  =  * ( in+  cO )  ; 
return  mx ; 

> 

/*  normalization  function  for  compelx  data  */ 
void  cmplx_normal ize ( complex  *in,  int  m) 

{ 

int  cO  ; 
double  mx ; 

mx  =  cmplx_max ( in  ,  m)  ;  //  find  the  maximum  (magnitude) 

for  (c0=0;  cO<m;  c0++) 

*(in+cO)  =  *( in+ cO )/( complex ) mx ;  //  divide  it  out 

> 

double  cmplx_max ( complex  *in,  int  m) 

{ 

int  cO  ; 

double  mx  ,  tmp=0; 
mx  =  cabs ( *  in )  ; 
for  (cO=l;  cO<m;  c0++) 
tmp  =  cabs ( * ( in+ cO ) ) ; 
if  (tmp  >  mx) 
mx  =  tmp  ; 
return  mx ; 

> 

/*  delta -removed  speckle  psd  calculation  function  */ 
void  spec_psd ( double  *in,  double  *rslt  ,  int  n) 

{ 


//  do  inverse  fft 

//  copy  the  real  part  to  out 

//  fft shift  it 

//  deallocate  working  arrays 


complex  *tmpl  ,  *tmp2;  //  pointer  to  complex  data  type 
complex  delta  =  n*n; 

tmpl  =  ff tw.malloc (n*n* sizeof ( complex )) ;  //  allocate  memory 

tmp2  =  ff tw.malloc (n*n* sizeof ( complex )) ; 

rl_t  o_cmplx_copy  ( in  ,  tmpl,  n*n)  ;  //  copy  to  the  working  array 

fft2(tmpl,  tmp2 ,  n,  n) ;  //  do  the  fft 

*tmp2  =  *tmp2  -  delta;  //  remove  the  delta 

abssq ( tmp2 , n*n) ;  //  get  the  mag  squared 
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cmplx_t o_rl_copy ( tmp2 , rslt ,n*n) ; 
fftw_free(tmpl) ; 
fftw_free(tmp2) ; 
rl_fftshift (rslt ,  n  ,  n) ; 


//  copy  rslt  to  output  array 
//  free  memory  up 


//  f ft  shift  the  result 


return ; 

> 

/*  delta-removed  speckle  psd  calculation  for  a  circular  aperture  */ 
void  circle_spec_psd ( double  *in,  double  *rslt  ,  int  n) 

{ 

complex  * tmpl ; 
complex  *subarray; 
complex  delta; 
int  m  ,  st rt  ; 
int  cO  ,  cl  ; 

m  =  n/sqrt(2);  //  dimension  of  the  largest  inscribed  square 
strt  =  n/2-m/2;  //  where  the  sub-array  starts 

//  allocate  working  arrays 

subarray  =  f f tw.mallo c (m*m* s izeof ( complex )) ; 
tmpl  =  ff tw.malloc (n*n* sizeof ( complex )) ; 

delta  =  m*m;  //  how  big  is  the  delta.  .  . 

for  (c0=0;  cO<m;  c0++)  //  copy  out  the  sub-array  w/ in  pupil 

for(cl=0;  cl<m;  cl++) 

* ( subarray+cO+m*cl )  =  * ( in  +  cO+strt  +  ( c 1 + st rt ) *n) ; 
inplace_f  ft  2  (  subarray  ,  m,  m)  ;  //  take  the  fft 

*subarray  =  *subarray  -  delta;  //  take  out  the  delta  at  zero  freq 

abssq ( subarray  ,  m*m)  ;  //  get  the  absolute -value  squared... 

cmplx_f f t shif t ( subarray  ,  m,  m) ;  //  fft-shift  it 

//  now  interpolate  it  back  to  the  right  size  by  fft-zeropad-ifft 
inplace_f ft 2 ( subarray ,  m,  m) ;  //  go  back  to  fft-land 

cmplx_f ft shif t ( subarray ,  m,  m) ;  //  fft-shift 

for  (c0=0;  cO<n;  cO++)  //  zero-pad 

t 

for(cl=0;  cl<n;  cl+  +  ) 

{ 

if  (cO<strt  I  I  cO>  =  m+strt  I  I  cl<strt  I  I  cl>=m+strt) 

* ( tmpl +cO+n* cl )  =  0; 
else 

* ( tmpl +cO+n* cl )  =  *(subarray  +  (cO-strt)  +  ( cl - strt ) *m) ; 


> 


> 


//  copy  the  result  out  to  the  result  array 
cmplx_abs_copy ( tmpl ,  rslt,  n*n)  ; 

//  free  the  working  arrays 
f ftw.free (sub array) ; 
fftw_free(tmpl) ; 


> 


//  scale  the  input  array  to  have  unit-average  values 
double  unit .scale ( double  *in,  int  pixels) 

{ 

double  sum=0; 
int  cO  ; 

for  (c0=0;  cO<pixels;  cO++) 
sum  +=  * ( in+cO ) ; 
sum  =  ( double ) pixels / sum ; 
for  (c0=0;  cO<pixels;  c0++) 

*(in+cO)  =  * ( in  + cO ) *  sum ; 
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return  sum ; 


> 


Listing  A.  15:  zernikes.h 

double  zernike_val (double  r,  double  theta,  int  P) ; 

/  * 

double  val  =  z ernike_v al ( doub l e  r,  double  theta,  int,  n,  int  m) ; 
evaluates  Z_n~ m(r ,  theta) ,  the  n,m  order  zernike  polynomial  at  the  radial 
point  r,  and  the  angular  point  theta. 

inputs : 

double  r  ==  radial  position .  note:  these  are  only  defined  for 

r<l ,  it  will  return  a  value, 
but  it  is  not  valid. 

double  theta  ==  angular  position .  0<= theta  <2pi 

int  P  ==  Noll-indexed  zernike  mode  number 

returns  : 

val  ==  value  of  the  (n ,  m)  zernike  mode  at  location  (r,  theta) 

*/ 

void  zernikemode  ( int  P,  int  *n ,  int  *m)  ; 

/*  returnes  the  radial  and  frequency  indices  n  and  m  for  a  given 
noil-indexed  zernike  mode  P 

inputs : 

P  ==  int  noil-indexed  mode  number 
outputs  : 

n  ==  int  ptr  to  radial  zernike  index 
m  ==  int  ptr  to  frequency  zernike  index  */ 

void  zernike_gr id ( double  *arry,  int  *mask ,  int  P,  int  m) ; 

/*  evaluate  the  P 3  th  zernike  mode  across  the  m  x  m  grid  assuming  the  array 
extends  from  -1  to  1  in  both  directions,  and  mask  the  result  outside  of 
the  unit  - circ le . 

inputs  : 

array  ==  double  pointer  to  the  allocated  output  array 
mask  ==  int  pointer  to  the  mask  array 
P  ==  int  noli  index  of  the  desired  mode 
m  ==  int  number  of  pixies  across  the  array 

*/ 

void  masked.zscreen (double  *arry  ,  double  *mask ,  int  P,  int  m) ; 

/*  evaluate  the  P 1  th  zernike  mode  across  the  m  x  m  grid  assuming  the  array 
extends  from  -1  to  1  in  both  directions  ,  and  mask  the  result  in 
accordance  with  the  analog  mask  array  *mask.  For  the  zernike  screen  to 
be  valid,  the  mask  must  be  zero  for  all  pixels  that  lie  completely 
outside  the  unit-circle. 

inputs : 

array  ==  double  pointer  to  the  allocated  output  array 
mask  ==  double  pointer  to  the  mask  array 
P  ==  int  noli  index  of  the  desired  mode 
m  ==  int  number  of  pixies  across  the  array 

*/ 

void  zs creen ( double  *arry,  int  P,  int  m) ; 

/*  make  a  zernike  mode  screen  that  is  valid  across  the  entire  grid, 
i.e.  the  unit-circle  completely  encompases  the  grid. 

this  just  like  zernike_grid  with  the  spatial  limits  such  that  -1/sqrt  (2) 
<=  (x,y)  <=  1/sqrt  (2)  so  that  the  entire  grid  fits  inside  the  unit  circle 
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64 

3 

8 

13 

18 

23 

28 

33 

38 

43 

48 

53 


inputs : 

array  ==  double  pointer  to  the  allocated  output  array 
P  ==  int  noli  index  of  the  desired  mode 
m  ==  int  number  of  pixies  across  the  array 

*/ 

Listing  A. 16:  zernikes  .  c 

/*  this  file  is  for  development  and  testing  of  zernike  -mode  generation 
code .. .  */ 

#include  "mylibs.h" 

#include  " zernikes . h" 

/*  see  zernikes. h  for  details  about  calling  these  functions  */ 


/** 


t  */ 


double  zernike_val (double  r,  double  theta,  int  P) 

{ 

int  n  ,  m  ,  s  ; 
int  odd ; 

double  R,  num ,  den,  scl  ; 
double  tmpl ,  tmp2  ; 


/*  use 
switch 
{ 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 

case 


hard-coded  zerikes  for  p<Jh5,  otherwise  compute  with  the  sum  */ 
(P) 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 
23 


1 ;  break ; 

2*r* cos ( theta) ;  break; 
2*r* sin ( theta) ;  break; 
sqrt (3) * (2*r*r-l) ;  break 
sqrt (6) *r*r*sin (2*theta) 
sqrt (6) *r*r*cos (2*theta) 
sqrt  (8)  *  (3* r  *r *r  - 
sqrt  (8)  *  (3* r  *r *r  - 


25 

26 

27 

28 
29 


24:  R 


break ; 
break ; 

2*r )* sin (theta) ; 
2*r )* cos ( theta) ; 


break ; 
break ; 


sqrt (8) *r*r*r*sin(3*theta) ;  break; 
sqrt (8) *r*r*r*cos (3*theta) ;  break; 
sqrt (5) * (6*r*r *r*r  -  6*r*r  +1);  break; 


sqrt ( 10) * 
sqrt (10) *(4* 
sqrt (10) 
sqrt (10) 
sqrt  ( 12) * 
sqrt (12) *(10 
sqrt ( 12) * 
sqrt  (12)  *  (5 
sqrt (12) 
sqrt (12) 
sqrt  (7) * 
sqrt ( 14) * 

sqrt (14) * ( 

sqrt ( 14) * (6s 
sqrt (14)  *  < 
sqrt (14) 
sqrt (14) 


-  3*r *r ) *  cos (2* theta)  ;  break; 

-  3*r *r ) *  sin (2* theta) ;  break; 
(4*theta) ;  break; 

(4*theta) ;  break; 

*cos (theta) ;  break; 
*sin(theta);  break; 

-  4* r *r *r ) *  cos (3* thet a )  ;  break; 

-  4*r*r*r) *sin (3*theta) ;  break; 
(5*theta);  break; 

(5*theta);  break; 

r*r*r*r+i2*r*r-l) ;  break; 
20*r*r*r*r  + 

6*r *r ) *  sin (2* theta) ;  break; 

*r*r*r*r*r*r  -  20*r*r*r*r  + 

6*r *r ) *  cos  (2* theta)  ;  break; 

*  sin (4* thet a)  ;  break; 


30 :  R 


4*r ) *  sin ( thet a)  : 


4*r ) *  cos ( thet a)  : 


l (6*  theta) ; 
i  (6* theta) ; 
60*r*r *r*r*r 
break ; 

60*r*r*r*r*r 
break ; 


i *  cos (4*  theta)  : 
break ; 
break ; 

+  30*r *r *r  - 
+  30*r *r *r  - 


break ; 
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93 
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103 

108 

113 

118 


case  31 :  R 
case  32:  R 


case  33 
case  34 
case  35 
case  36 
case  37 


case 

38  : 

R 

= 

sqrt (18) 

case 

39  : 

R 

= 

sqrt (18) 

case 

40  : 

R 

= 

sqrt (18) 

case 

41  : 

R 

= 

sqrt (18) 

case 

42  : 

R 

= 

sqrt (18) 

case 

43  : 

R 

= 

sqrt (18) 

case 

44: 

R 

= 

sqrt (18) 

case  45: 

default  : 

R 

= 

sqrt (18) 

/* 

get 

the 

zernike  i 

30*r*r*r*r*r  + 

10*r *r *r ) *  sin (3* theta)  ;  break; 

4 * ( 2 1 * r * r * r * r * r * r * r  -  30*r*r*r*r*r  + 

10*r *r *r ) *  cos (3* theta)  ;  break; 

) *  cos (5*  theta)  ; 

J *  theta)  ;  break ; 

;(7*theta);  break; 

90*r*r*r*r  -  20*r*r  +  1);  break; 

60*r*r*r*r  -  10*r *r ) *  cos  (2* theta)  ;  break; 

60*r*r*r*r  -  10*r *r ) *  sin  (2* theta)  ;  break; 

15*r*r*r *r ) *  cos (4* theta)  ;  break; 

*  sin (4* theta)  ;  break; 

*  cos (6* theta) ;  break; 

*  sin  (6* theta) ;  break; 

*  cos  (8* theta)  ;  break; 

)* sin  (8* theta)  ;  break; 


zernikemode (P ,  &n ,  &m) ; 
if  (  P  °/0  2  =  =  0  ) 
odd=0 ; 
else  odd=l ; 


/*  use  the  equations  from  Noll  to  compute  the  zernike  val  */ 
R  =  0; 

for  (s  =  0;  s<=(n-m)/2;  s+  +  ) 

{ 

if  (s°/»2  =  =  0)  //  even  power... 

scl  =  1 ; 

else  //  odd  power 
scl  =  -1 ; 

num  =  scl  *  f actorial (n-s) ; //  *  pow  (r ,  (n- (2* cO) ) ) ; 
den  =  factorial (s)  *  f actor ial ( (double ) (n-m) /2  -  s)  * 
fact orial (( double ) (n+m) /2  -  s); 

R  +  =  (num/den) *pow (r , (n- (2* s) ) ) ; 


if  (m==0) 

R  *=  sqrt (n+ 1 ) ; 
else  if  (odd  ==  1) 

R  *=  sqrt (2* (n  +  1) )* sin (m* theta)  ; 
else 

R  *=  sqrt (2* (n  +  1) ) *  cos (m*theta)  ; 


> 

return  R; 

> 

void  zernikemode  ( int  P,  int  *n ,  int  *m) 

{ 

int  cntr=l,  reuse=0; 
int  n0  =  0 ,  mO  =0 ; 
while  (cntr  <P) 

{ 

nO  += 1 ; 

if  ((n0  -  2*(n0/2))  ==  0)  /*  if  n  is  even  start  at  m=0  */ 

i 
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188 


0; 


mO  =  0 ; 
reuse  = 
cntr  +  + ; 
if  (cntr  >=P) 

{  *n  =  nO ;  *m  =  mO ;  return;} 
while  (mO<nO  I  I  reuse=  =  l) 

{ 

if  ( r euse  == 1 ) 

{ 

reuse  =  0; 
cntr  ++ ; 
if  (cntr>=P) 

{  *n  =  nO ;  *m  =  mO ;  return;} 

} 

else 

{ 

reuse  =  1; 
mO  +=2 ; 
cntr  ++ ; 
if  (cntr>=P) 

{  *n  =  nO ;  *m  =  mO ;  return;} 

} 

} 

} 

else  /*  n  is  odd.  .  .  */ 

{ 

reuse  =  1; 
mO  =  1; 
cntr  +  + ; 
if  (cntr  >=P) 

{  *n  =  nO ;  *m  =  mO ;  return;} 
while  (mO<nO  I  I  reuse  ==1) 

{ 

if  (reuse  == 1 ) 

{ 

reuse  =  0; 
cntr  ++ ; 
if  (cntr>=P) 

{  *n  =  nO ;  *m  =  mO ;  return;} 

} 

else 

{ 

reuse  =  1; 
mO  +  =  2 ; 
cntr  ++ ; 
if  (cntr>=P) 

{  *n  =  nO ;  *m  =  mO ;  return;} 

} 

} 

} 

} 

*n  =  nO ; 

*m  =  mO ; 

} 


void  zernike_gr id ( double  *arry ,  int  *mask ,  int  P, 

{ 


int  cO,cl;  //  counters 

double  dx ;  //  point  spacing  along  the  grid 


double  x , y , r , thet a=0 ; 
double  pi  =  M_PI ; 
double  *tmp  =  arry ; 
int  *tmp2=mask; 


//  coordinate  variables 
//  shorthand  for  pi 
//  output  array 
//  mask  array 


int  m) 


dx  =  ( double ) 2/ (m-1) ;  //  dx 
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193 

198 

203 

208 

213 

218 

223 

228 

233 

238 

243 

248 

253 


x 


y  = 

for 

{ 


-1; 

-l; 

(c0=0;  cO<m;  c0++) 


for  (cl=0;  cl<m;  cl+  +  ) 

{ 

r  =  sqrt(x*x  +  y*y)  ;  //  get  the  radius 

if  (r >1) 

{ 

*tmp  =  0;  //  zero  outside  unit-circle 

*tmp2  =  0;  //  make  the  mask 

} 


else 

{ 

*tmp2  =  1;  //  make  the  mask 

//  figure  out  what  quadrant  we  are  in 
if  (x  ==  0)  //  avoid  divide  by  zero 
{ 

if  (y  >  =  0) 

theta  =  pi/2; 
else  if  (y<0) 

theta  =  3*pi/2; 

> 

else 

if  (x>0)  //  take  care  of  1st  &  J^th  quadrants 

theta  =  atan(y/x); 

else  //  second  and  third  quadrant 

theta  =  pi  +  atan(y/x); 

*tmp  =  zernike.val (r ,  theta,  P) ;  //  value  inside 

} 

x  +=  dx ;  //  increment  x 

tmp++;  //  increment  grid  pointer 

tmp2++;  //  increment  mask  pointer 

> 


x  =  -1;  //  reset  x 

y+=dx;  //  increment  y 


circle 


void  masked.zs creen ( double  *arry ,  double  *mask ,  int  P,  int  m) 

{ 


int  cO 
double 
double 
double 
double 


cl  ;  //  counters 

dx ;  //  point  spacing  along  the  grid 


x , y , r , thet a=0 ; 
pi  =  M_PI  ; 

*  tmp  =  arry ; 


double  * tmp2=mask ; 


//  coordinate  variables 
//  shorthand  for  pi 
//  output  array 
//  mask  array 


dx 


( double ) 2/ (m- 1 ) ;  //  dx 


x  =  -1; 

y  =  -i; 

for  (c0=0;  c0<m;  c0++) 

{ 

for  (cl=0;  cl<m;  cl++) 

{ 

r  =  sqrt(x*x  +  y*y)  ;  //  get  the  radius 

if  (*tmp2  =  =  0) 

*tmp  =  0;  //  zero  outside  unit-circle 

else 

{ 

//  figure  out  what  quadrant  we  are  in 
if  (x  ==  0)  //  avoid  divide  by  zero 
{ 

if  (y  >  =  0) 

theta  =  pi/2; 
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1st  &  4 th  quadrants 


> 


> 


else  if  (y<0) 

theta  =  3*pi/2; 

> 


else 

if  (x>0)  //  take  care  of 

theta  =  atan(y/x); 
else  //  second  and  third  quadrant 

theta  =  pi  +  atan(y/x); 

*tmp  =  * tmp2 * zernike_val (r ,  theta,  P);  //  value 

} 


x  +=  dx ; 
tmp  +  + ; 
tmp2  +  + ; 

> 

x  =  -1; 
y +=dx ; 


//  increment  x 
//  increment  grid  pointer 
//  increment  mask  pointer 

//  reset  x 
//  increment  y 


inside  circle 


void  zs creen ( double  *arry ,  int  P,  int  m) 

{ 

int  cO,cl;  //  counters 

double  dx ,  xmax ;  //  point  spacing  along  the  grid 

double  x , y , r , thet a=0 ;  //  coordinate  variables 

double  pi  =  M_PI ;  //  shorthand  for  pi 

double  *tmp  =  arry  ; 


xmax  =  2 . 0/ sqrt ( 2 . 0 ) ;  //  twice  the  max  x  or  y  value 
if  ( (m-2* (m/2) ) ==0)  //  for  an  even  sized  grid 

dx  =  xmax/m;  //  make  sure  we  hit  zero  at  the  right  spot 
else  //  for  an  odd- sized  grid 

dx  =  xmax/ (m-1) ; 

x  =  -1/ sqrt  (2)  ; 
y  =  x; 

for  (c0=0;  cO<m;  c0++) 

{ 

for  (cl=0;  cl<m;  cl++) 

{ 

r  =  sqrt (x*x  +  y*y)  ; 

//  figure  out  where  we  are  first 
if  (x  ==  0)  //  avoid  divide  by  zero 
{ 

if  (y  >  =  0) 

theta  =  pi/2; 
else  if  (y<0) 

theta  =  3*pi/2; 

} 

else  if  (x>0)  //  talke  care  of  1st  &  J^th  quadrants 

theta  =  atan(y/x); 

else  //  second  and  third  quadrant 

theta  =  pi  +  atan(y/x); 

*tmp  =  zernike_val (r ,  theta,  P) ; 
x  +=  dx ; 
tmp  +  + ; 

> 

x  =  -1/ sqrt  (2)  ; 
y+=dx ; 

> 

> 
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Appendix  B.  Simulation,  Data  Generation,  and  Data  Reduction  Code 


This  appendix  contains  the  code  sections  used  to  generate  simulated  data,  run  the 
APDI  algorithm,  and  post-process  the  results.  Matlab®  code  and  associated  mex  files 
were  compiled  using  Matlab®  R14SP3,  Gnu  make  and  gcc  3.4.6.  The  FFTW3  library  is 
required  to  build  several  of  the  mex  files. 


B.l  Simulation  Code 

Simulated  data  was  generated  using  Matlab®  .  The  uniform  and  normal  random 
number  generators  were  seeded  using  random  bits  read  from  the  Linux  /dev/ur andorn  device 
file,  or  alternatively  with  the  system  time. 

Listing  B.l:  writePDdata.m 

function  []  =  writePDdat a ( snr , f rms ,  det _ s igma , var argin ) 

^=========================================================================== 

X  Peter  Johnson 
4  X  10  Jul  2006 
X 

X  writePDdata  (snr , frms  ,  det_sigma)  generates  simulated  active  "phase  -  diversity 
X  data  and  a  starting  guess  for  the  APD  reconstruction  algorithm. 

X 

9  X  snr  is  the  overall  detection  SNR  encompasing  poton  and  gaussian 
X  read  noise ,  frms  is  the  number  of  indpendent  noise  realizations  to 
X  generate  ,  det_sigma  is  the  number  of  rms  noise  el  ectrons /read  for 
X  the  gaussian  read  noise. 

X 

14  X  writ  ePDdat  a  ( snr  ,  frms  ,  det _sigma ob  j  f l . mat  ’ )  will  generate  data  using  the 
X  object  contained  in  ob  j  fl . mat 

o/o=  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  = 


X  compute  the  average  number  of  phot o - el ectrons  required  to  get  the  desired 
19  X  snr.  This  comes  from  inverting  snr  =  k/ s qrt (k+ s i gma ~2) 
photons  =  max ( roots ( [-1/ snr ~2  1  det _ s igma ~ 2] ) ) ; 


X  randomize  the  random  number  generators  . . 

X  inlcude  system  time  in  case  / dev/urandom  is  depleted  of  entropy 
24  Rfile  =  f open (’ /dev/urandom ’, ’ r ’) ; 
r  =  f read (Rf ile  ,  2  ,  ’ uint32  ’  )  ; 
f close (Rfile)  ; 

rand (’state ’ ,sqrt(r(l)*sum(100*clock)))  ; 
randn ( ’state ’ ,sqrt (r(2)*sum(100*clock))) 

29 


X  pull  from  the  linux  random 
X  device  for  seeds,  will  only 
X  work  with  linux 
X  seed  uniform-rand 
X  seed  normal -rand 


X  load  the  test  object 
if  nargin>3 

sat  =  char ( var argin ( 1 )) ;  X  see  if  we  were  passed  a  truth  object 

else 

34  sat  =  ’ medsat . mat ’ ;  X  if  not,  load  the  default  object 

end 

load ( sat )  ; 

s  =  size ( ob j )  ; 

obj  =  obj /mean (mean ( obj ))  ; 

39 


X  set  up  some  constants 

modes  =  15;  X  number  of  zernike  modes  to  compensate  for 

calcmodes  =  200;  X  number  of  modes  to  blur  data  with 


166 


44 

49 

54 

59 

64 

69 

74 

79 

84 

89 

94 

99 

104 


D_rO  =  15;  °/B  D/rO  for  the  phase  screen 

div  =  2;  °/0  add  waves  of  defocus  to  the  div  channel 


°/o  get  indexes/make  arrays  for  clear  parts  of  aperture  for  the  pp-psd 
m  =  f loor (s ( 1) / sqrt  (2)  )  ; 
strt  =  f ix ( s ( 1 ) /2 -m/2)  ; 
ind  =  strt +1 : strt +m+l ; 

tmpl  =  zeros  (s);  °/B  working  array 

tmp2  =  zeros(m);  °/B  temp  array  for  clear  part  of  pupil 

tmp3  =  zeros  (s);  °/B  working  array 


X  make  working  arrays 
dta  =  zeros ( [s , 2* f rms ] ) ; 
wghts  =  zeros ( 1 , modes  * f rms )  ; 
calcwghts  =  zeros ( 1 , calcmodes ) ; 
psd  =  zeros ( s ) ; 


%  storage 
°/0  s  tor  age 
°/0  storage 
°/0  storage 


array  for  the  data. . . 
for  ab err  at  ion  parameters 
for  aberration  parameters 
for  averaged  speckle  psd 


ft  make  frames  of  data  and  put  them  in  the  output  array 
for  cO=l : f rms 


°/o  make  the  focal  and  diversity  phase  screen 

inds  =  [1 :  modes ]  +  ( cO -1) *modes ;  °/B  index  to  aberr  parameters 

°/0  make  the  FP  phase  screen 

[scrn , msk , calcwghts]  =  circle.phasescreen (D_rO ,  s(l),  calcmodes); 

wghts(inds)  =  c al cwght s ( 1 : mode s ) ;  X  save  the  aberr  parameters 

divwghts  =  calcwghts;  °/B  copy  aberr  for  div  screen 

divwghts  (3)  =  divwghts  (3)  +  div*pi  .  /  sqrt  (3)  ;  °/0  add  defocus 

°/o  build  up  the  DP  phase  screen 

scrn2  =  zeros(s); 

for  c 1 =3  :  length ( divwght s ) 

[tmp ,  msk]  =  zernike_gr id ( cl +  1 , s  ( 1) )  ; 
scrn2  =  scrn2  +  di vwght s ( c 1 ) * tmp ; 
end 


°/o  get  the  pupil-plane  field 
fid  =  f ields im  (  ob j )  ; 


°/0  get  the  PP  intensity ,  add  noise ,  throw  out 
I_pp  =  abssq  (  f  Id  (  ind  ,  ind)  )  ;  °/B  get  the 
if  snr<100 

I_pp  =  poissrnd ( I_pp*photons ) ; 

I_pp  =  I_pp  +  det_sigma*randn ( size ( I_pp) ) ; 
I_pp  =  I _pp /phot ons ; 

I_pp  (  f  ind  (  I_pp  <0)  )  =  0; 
end 


negatives  and  normalize 
field  inside  the  aperture 

°/0  photon  noise 
X  read  noise 
°/0  normalize 
°/0  kill  negative 


°/0  compute  and  average  the  pupil-plane  speckle  psd 
tmp2  =  f f tshif t (f f t2 ( spec_psd ( I_pp) ) ) ; 
tmp3 ( ind , ind )  =  tmp2  ; 

psd  =  psd  +  real ( if ft2 ( if f tshif t (tmp3) )) ; 


°/0  make  the  in-focus  data  frames ,  add  noise  and  normalize 
I_fp  =  prod (s) *abssq ( if f t2 (msk . *f Id . *exp ( j *scrn) ) ) ; 
if  snr<100 

I_fp  =  poissrnd  (  I_fp*photons )  ;  °/0  photon  noise 

I_fp  =  I_fp  +  det_sigma*randn  ( size  ( I_f  p)  )  ;  °/B  read  noise 

I_fp  =  I_fp/photons  ;  °/B  normalize 

I_f  p  (  f  ind  (  I_f  p  <0)  )  =0;  °/B  kill  negative 

end 

dt a  (  :  ,  :  ,  2* cO - 1 )  =  I_fp; 


°/B  make  the  diversity  data  frames,  add  noise  and  normailze 
I_dp  =  pr  od  (  s  )  *  abssq  (  if  f  1 2  (msk  .  *  f  Id  .  *  exp  (  j  *  scrn2  )  )  )  ; 
if  snr<100 

I_dp  =  poissrnd  ( I_dp*photons )  ;  °/B  photon  noise 

I_dp  =  I_dp  +  det  _sigma*randn  ( size  ( I_dp)  )  ;  °/B  read  noise 
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I_dp  =  I_dp/photons ; 

I_dp  (  f  ind  (  I_dp  <0)  )  =  0; 
end 

dt  a  (  :  ,  :  ,  2*  cO  )  =  I_dp; 
end 

X  make  an  initial  guess  array  .  .  . 
guess_obj  =  .05*ones(s); 

guess_aber  =  0 . 5* randn ( 1 , length ( wght s )) ; 
for  c0=0 : f rms -1 

guess_aber (modes *cO+l : modes*c0+2)  =  0; 
end 

X  open  the  output  files 

guess  =  f open (’ guess . dat ’, 5 w ’,’ native ’) ; 
data  =  f open ( ’ data . dat ’  , ’ w 5 , * nat ive ’ ) ; 
truth  =  f open (’ truth . dat ’,’ w ’,’ nat ive ’) ; 

X  The  data.dat  file  contains  the  data  in  the  foilwing  configuration  the  first 
X  element  is  m,  the  dimension  of  the  array,  the  second  is  f,  the  number  of 
X  frames  of  data,  the  third  is  the  number  of  aberration  parameters,  k,  used 
X  to  generate  the  phase  screen,  the  fourth  is  the  number  of  waves  of  defocus 
X  used  to  generate  the  diversity  images.  The  remainder  of  the  file  contains 
X  the  data  values  in  the  following  format...  The  contiguous  block  of  f*m~2 
X  values  are  the  focal  and  diversity  intensities  ,  and  the  final  m~2  values 
X  are  the  averaged  delta-removed  and  unit-average  normalized  pupil-plane 
X  speckle  psd . 

X 

°/0  The  contiguous  block  of  focal  and  diversity  plane  data  are  broken  into  2m~2 
X  blocks,  each  block  containing  the  data  from  one  realization/ frame .  The 

°/0  first  m~2  values  are  the  focal-plane  data  for  this  frame  and  the  second  m~2 
ft  values  are  the  diversity  plane  data. 

X 

X  The  guess  and  truth  data  files  are  identical  with  the  exception  that  the 
X  truth  data  file  contains  the  true  object  and  aberration,  and  the  guess  file 
X  contains  the  initial  guess.  The  first  element  in  the  guess  file  is  m,  the 
X  dimension  of  the  object,  the  second  element  is  the  number  of  frames  f,  the 
X  third  the  number  of  ab  err  at  ion  parameters  k,  the  next  m~2  elements  are  the 
X  object  data  values,  and  the  last  f *k  elements  are  the  aberration  parameters. 


X  write  the 

initial 

guess  file... 

m  =  s  (  1 )  ; 

f write (guess 

,  m ,  ’ double ’ ) ; 

f write ( guess 

,  f rms  , 

J  double  5 )  ; 

f write (guess 

,  modes 

,  *  double  , )  ; 

f write (guess 

,  D_rO  , 

J  double  * )  ; 

f write ( guess 

,  div  , 

*  double  ’  )  ; 

cnt  =  f write (guess  , 

guess_obj  ,  ’double’ 

cnt  =  f write (guess , 

guess_aber ,  ’double 

X  write  the 

truth  data  file . . . 

f write (truth 

,  m ,  ’ double  ’  )  ; 

f wr it e (truth 

,  f rms  , 

’ double  ’  )  ; 

f write (truth 

,  modes 

,  ’ double  ’ )  ; 

f write (truth 

,  D_rO  , 

’ double  ’  )  ; 

f wr it e (truth 

,  div  , 

’ double  ’ )  ; 

cnt  =  f wr it e ( truth  , 

ob j  ,  ’ double  ’  )  ; 

cnt  =  f write (truth  , 

wghts  ,  ’ double  ’  )  ; 

X  write  the 

data  file... 

f wr it e ( data , 

m ,  ’ double  ’ )  ; 

f wr it e ( data , 

frms  , 

’ double  ’  )  ; 

f wr it e ( data , 

modes  , 

’ double  ’  )  ; 

f wr it e ( data , 

D_rO  , 

’ double  ’ )  ; 

f write ( data , 

div  ,  ’ 

double  ’  )  ; 

cnt  =  fwrite(data, 

dta ,  ’ double  ’  )  ; 

X  normalize 
X  kill  negative 
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cnt  =  cnt +f writ e ( data ,  psd ,  ’double’); 

°/0  output  status  and  close  the  files 

f  printf  (  ’ \nwrote  °/0d  °/#dx70d  frames  using  %d  modes \n  ’,  frms  ,  m  ,  m  ,  modes  )  ; 
f  close ( ’ all  ’ )  ; 


Listing  B.2:  circle_phasescreen.m 

function  [screen,  mask,  X]  =  circle.phasescreen (D_rO , n , k) 
^=========================================================================== 

°/0  Peter  Johnson 
%  7  Mar  2006 
% 

°/0  [ screen ,  mask,  X ]  =  circl  e_phas  es  creen  (D_rO  ,  n ,  k) 

X 

X  Generate  an  nxn  tilt -removed  circularly - ob scured  phase  screen  for  a  D/r_0 
°/0  ratio  of  D_rO  using  the  first  k  Zernike  polynomials  ( the  first  3  are 
°/0  actually  ingored,  but  counted  anyway...)  and  the  covariance  (Noll)  matrix 
X  for  the  Zernike  coefficients 

Yo 

X  the  weights  used  for  the  zernike  modes  are  returned  in  the  vector  X.  The 
Yo  first  two  elements  are  ignored  as  tip  and  tilt.  Piston  is  not  calculated 
Yo  nor  carried  along  as  a  term... 

o/o=  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  =  = 


Yo  covariance  matrix  for  the  zerike  coefficients 


K  =  D_rO ~ ( 5/3 ) * nollmat r ix  (k)  : 
K  =  chol (K)  >  ; 

X  =  randn (k  ,  1 )  ; 

X  =  K*X  ; 


X (1 : 2) 


0; 


Yo  make  the  noil-matrix  for  modes  2 
Yo  Cholseky  Factorize  K 
Yo  Generate  gaussian  random  numbers 
Yo  "color"  them  with  K 
Yo  throw  out  tip/tilt 


=>  k  +  1 


Yo  now  buld  up  the  phase-screen  and  mask 
screen  =  zeros(n); 
for  cO=l : k 

if  (X(c0)~=0) 

[tmp , mask]  =  zernike_gr id ( cO+1 , n) ; 
screen  =  screen  +  X(cO)*tmp; 

end 

end 


return 


Listing  B.3:  fieldsim.m 

function  obj  =  f ields im ( ob j ) ; 

Yo  Peter  Johnson 
Yo  16  Nov  2005 

Yo  obj  =  f  i  el  dsim  ( ob  j ) ;  Uses  AFRL  gausspec  style  speckle  simulation  to 
Yo  generaterate  the  complex  fields  in  the  pupil  plane  assuming  an  optically 

Yo  rough  object.  The  object  array  must  be  square  for  the  scaling  to  be 

Yo  correct  .  The  resulting  field  should  have  unit-average  intensity . 

Yo 

Yo  It  uses  the  object  to  produce  circular  complex  RV  ’  s  with  statistics 
Yo  consistent  with  the  object,  then  propagates  the  RV  ’  s  to  the  far- field  to 
Yo  generate  the  pupil-plane  complex  field.  A  speckle  realization  is  found 
Yo  by  taking  the  square  modulus  of  the  field.  This  implementation  is  set  to 

Yo  give  a  unit-average  intenisy  in  the  pupil  plane. 

Yo 

Yo  This  is  faster,  and  agrees  better  with  theory  than  the  original  upsample 
Yo  ~  add  phase  -  propagate  -  down  select  method  I  used  before.  Definitely  the 
Yo  better  option . 

obj  =  obj /mean (mean ( obj )) ;  Yo  scale  to  unit  average  intensity 
sigma  =  sqrt(obj/2);  Yo  standard  deviation  for  RV  *  s 

s  =  size(sigma);  Yo  size  of  the  object  array 
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rl  =  s igma . * r andn ( s ) ; 
im  =  s igma . * r andn ( s ) ; 
obj  =  rl  +  i*im; 


'/,  make  circular  complex  gaussian  RV  ’  s 


obj  =  f  f  t  shif  t  (  f  f  1 2  (  ob  j  )  )  ;  '/,  propagate  to  far-field. 


obj  =  ob  j  .  /  s  (  1 )  ; 
return 


7  re-scale  it  to  unit  average  intensity  at  this  point 
7,  this  is  a  result  of  MATLAB  ’  s  implementation  of  FFT  ’  s 


Listing  B.4:  Makefile  for  Matlab®  MEX  files 

#  set  up  variables 
HOME=~pjohnson 
CC=gcc 

MATLAB  =/ apps / Linux 8 6 /mat lab 14sp3 

MEX =$ (MATLAB) /bin /mex 

MATLAB INC =-I$ (MATLAB) /extern /include 

MATL IBS =$ (MATLAB) /extern /lib 

CCFLAGS  =  - 03  -march  =  i686  -fPIC  -Wall 

LIBPATH= 

SEARCHPATH=-I$ (HOME) /lib/  - 1$ ( HOME ) / lib /header s 
INCLUDE=- If f tw3  -lm 
MATLIB= . /matlab 
MYLIB=mylibs . o 

#  the  mex  files  that  can  be  built / cleaned/ etc  .  .  . 

MEXFLS= spec_psd . mexglx  zernike.gr id . mexglx  nollmat r ix . mexglx \ 
circle.spec.psd .mexglx 

MXOB JS=psdmex . o  zernike.gr idmex . o  nollmat rixmex . o  circle.spec.psdmex . o 
OB JECTS=zernikes . o  mylibs.o 

#  options  for  the  different  kinds  of  files 

MEX0PTS=$ (LIBPATH)  $ ( SEARCHPATH )  $(INCLUDE)  -output 
00PTS=-c  $ ( SEARCHPATH )  $(CCFLAGS)  $ ( MATLABINC ) 

#  rule  to  make  all  the  mex  files 
.PHONY:  mex 

mex:  $(MEXFLS) 

mv  *. mexglx  ./matlab/ 

#  rule  to  make  all  object  files 
.PHONY:  obj  s 

obj  s :  $ ( OBJECTS ) 

#  stuff  for  making  mylibs.o 
mylibs.o:  mylibs . c 

$(CC)  -c  mylibs. c  -o  $ ( MYLIB )  $(00PTS) 

#  stuff  for  making  a  circular-aperture  zernike  mode 
ZGRIDOB=zernikes . o  $(MYLIB)  zernike.gridmex . o 
zernike.gr id . mexglx :  $(ZGRID0B) 

$ ( MEX )  —  c xx  $ ( ZGRIDOB )  $(MEX0PTS)  zernike_grid 
zernike.gridmex . o :  $( MATLIB ) /zernike.gridmex . c  zernikes.o 

$(CC)  $ (MATLIB) /zernike.gridmex . c  $(00PTS) 

#  stuff  for  making  the  psd  functions 
PSDOB J =psdmex . o  mylibs.o 

spec.psd . mexglx :  $(PSD0BJ) 

$ ( MEX )  —  c xx  $ (PSDOBJ )  $(MEX0PTS)  spec.psd 
psdmex . o :  $( MATLIB ) /psdmex . c 

$(CC)  $ ( 00PTS )  $ (MATLIB) /psdmex . c 

#  stuff  for  making  the  circl e_sp  ec_p  s  d  mex f unction 
CRCPSDOBJ=circle_spec_psdmex . o  mylibs . o 
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circle_spec_psd.mexglx:  $(CRCPSDOBJ) 

$ ( MEX )  -cxx  $ ( CRCPSDDB J )  $(MEXOPTS)  circle_spec_psd 
circle_spec_psdmex.o:  $(MATLIB)/circle_spec_psdmex.c 
$(CC)  $(OOPTS)  $ ( MATLIB ) / c ir c le_spec _psdmex . c 

#  stuff  for  making  the  noli  -  covariance  matrix 
NOLLOB J =no llmat r ixmex . o  zernikes . o  mylibs.o 
nollmatrix . mexglx :  nollmatrixmex . o 

$ ( MEX )  -cxx  $ ( NOLLQB J )  $(MEX0PTS)  nollmatrix 
nollmatrixmex .o:  $(MATLIB)/nollmatrixmex.c 

$(CC)  $(00PTS)  -std=c99  $ (MATLIB ) /nollmatrixmex . c 

#  clean  up  stuff 
.PHONY  :  clean 
clean : 

rm  $ ( OBJECTS )  $(MEX0BJS) 
cd  matlab ;  rm  $(MEXFLS) 

Listing  B. 5:  zernike_gridmex. c 

/*==================================================================== 

*  Wrapper  function  for  matlab  to  run  subroutines  written  in  C.  */ 

#include  "mex.h" 

#def ine  LOCAL 
#include  "mylibs.h" 

#include  " zernikes . h" 

void  mexFunc t i on ( int  nlhs  , mxArr ay  *plhs  []  ,  int  nrhs , const  mxArray  *prhs  []  ) 

{ 

double  *arry ,  *mask ,  *m  ,  *P; 
int  mi  ,  p  ,  *msk  ,  cO  ; 

/*  check  for  complex  input  field  */ 
if  (mxIsComplex (prhs  [0] ) ) 

mexErrMsgTxt (" Input  must  be  real"); 

/*  error  check  the  number  of  input  and  output  arguments  */ 
if  (nrhs  !=  2) 

mexErrMsgTxt ("need  2  input  args " ) ; 
if  (nlhs  !=  2) 

mexErrMsgTxt ("Must  have  2  output  args"); 

/*  get  a  pointer  to  the  input  arrays  */ 

P  =  mxGetPr ( prhs [0] ) ; 
m  =  mxGetPr (prhs  [1] )  ; 
mi  =  *m; 
p  =  *P  ; 

/*  make  the  mask  array  */ 

msk  =  malloc (mi *mi * sizeof ( int ) )  ; 

/*  make  the  output  arrays  to  go  back  to  matlab  and  set  the  pointer  */ 

plhs [0]  =  mxCreat eDoubleMatr ix (mi , mi , mxREAL) ; 

plhs  [1]  =  mxCreat eDoubleMatr ix (mi , mi , mxREAL)  ; 

arry  =  mxGet Pr ( plhs  [0] )  ; 

mask  =  mxGetPr (plhs  [1] )  ; 

/*  call  the  subroutine  */ 
zernike_gr id ( arry ,  msk,  p,  mi); 

/■ *  copy  the  mask  to  the  matlab  array  */ 
for  (c0=0;  c0<mi*mi;  c0++) 

*(mask+c0)  =  *(msk+c0); 

free (msk) ; 
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return ; 


> 


Listing  B. 6:  psdmex.c 

/*==================================================================== 

*  Wrapper  function  for  matlab  to  run  subroutines  written  in  C  .  */ 

#def ine  LOCAL 
#include  "mylibs.h" 

#include  "mex.h" 

void  psdtest ( double  *  array ,  double  *rslt  ,  int  n)  ; 

void  mexFunc t i on ( int  nlhs , mxArr ay  *plhs  []  ,  int  nrhs , const  mxArray  *prhs  []  ) 

{ 

int  n,m; 

double  *speckle,  *psd; 

/*  check  for  complex  input  field  */ 
if  (mxl sComplex (prhs  [0] ) ) 

mexErrMsgTxt (" Input  speckle  field  must  be  real"); 

/*  error  check  the  number  of  input  and  output  arguments  */ 
if  (nrhs  !=  1) 

mexErrMsgTxt (" Only  one  input  is  allowed"); 
if  (nlhs  !=1) 

mexErrMsgTxt ("Must  have  one  output  argument"); 

/*  get  the  dimension  of  the  field  */ 
n  =  mxGetN ( prhs  [0] )  ; 
m  =  mxGetM ( prhs  [0] )  ; 
if  (n  !  =  m) 

mexErrMsgTxt (" Input  must  be  square"); 

/*  get  a  pointer  to  the  input  array  */ 
speckle  =  mxGetPr ( prhs  [0] )  ; 

/ *  make  the  output  array  to  go  back  to  matlab  and  set  the  pointer  */ 
plhs  [0]  =  mxCreat eDoubleMatr ix (m , n , mxREAL )  ; 

psd  =  mxGetPr ( plhs  [0] )  ; 

/*  call  the  psd  subroutine  */ 
spec_psd(speckle ,psd,n) ; 

return ; 


> 


Listing  B. 7:  circle_spec_psdmex .  c 

/*==================================================================== 

*  Wrapper  function  for  matlab  to  run  subroutines  written  in  C. 

* 

*  this  mex  file  is  for  computing  the  delta-removed  speckle  psd  for  the  case 

*  when  the  pupil  is  circular.  This  is  done  by  using  the  largest  inscribed 

*  rectangle  of  data  available  in  the  pupil,  computing  the  delta-removed  psd, 

*  then  interpolating  using  an  f ft - zerop ad - i f f t  approach. 

*/ 

#def ine  LOCAL 
#include  "mylibs.h" 

#include  "mex.h" 

void  psdtest ( double  *array,  double  *rslt  ,  int  n) ; 

void  mexFunct ion ( int  nlhs, mxArray  *plhs  []  , int  nrhs  , const  mxArray  *prhs  [] ) 

{ 
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int  n  ,  m  ; 

double  *speckle,  *psd; 

/*  check  for  complex  input  field  */ 
if  (mxIsComplex (prhs  [0] ) ) 

mexErrMsgTxt (" Input  speckle  field  must  be  real"); 

/*  error  check  the  number  of  input  and  output  arguments  */ 
if  (nrhs  !=  1) 

mexErrMsgTxt (" Only  one  input  is  allowed"); 
if  (nlhs  !=1) 

mexErrMsgTxt ("Must  have  one  output  argument"); 

/*  get  the  dimension  of  the  field  */ 
n  =  mxGetN ( prhs  [0] )  ; 
m  =  mxGetM ( prhs  [0] )  ; 
if  (n  !  =  m) 

mexErrMsgTxt (" Input  must  be  square"); 

/*  get  a  pointer  to  the  input  array  */ 
speckle  =  mxGetPr ( prhs  [0] )  ; 

/*  make  the  output  array  to  go  back  to  matlab  and  set  the  pointer  */ 
plhs  [0]  =  mxCreat eDoubleMatr ix (m , n , mxREAL )  ; 

psd  =  mxGetPr ( plhs  [0] )  ; 

/*  call  the  psd  subroutine  */ 
circle_spec_psd ( speckle ,psd,n) ; 

return ; 


> 


Listing  B.8:  nollmatrixmex .  c 

/*========================================================================== 

*  Peter  Johnson 

*  9  March  2006 

* 

*  Me x  function  for  computing  the  noil  -  covariance  matrix  for  atmospheric 

*  turbulence.  This  is  approximately  55  times  faster  than  the  matlab  code 

*  Pete  Crabtree  wrote ,  and  gives  the  same  results  to  within  machine 

*  precision . 

* 

*  Calling  Sequence: 

*  M  =  no l Imatrix (n) 

* 

*  this  will  produce  the  nxn  noil - cov ari ance  matrix,  which  must  be  scaled  by 

*  (D/rO )  ~  (5/3)  before  it  represents  the  covariance  of  zernike  terms  in 

*  atmospheric  turbulence. 

*/ 

# if ndef  M_PI 

#def ine  M_PI  3.14159265358979323846  /*  pi  */ 

#endif 

#include  "mex.h" 

#def ine  LOCAL 
#include  "mylibs.h" 

#include  "zernikes.h" 

double  noll_covar iance ( int  n,  int  m) ; 

/*  C  =  no 1 1_ cov ari ance  ( int  n,  int  m) ;  compute  the  zernike -mode  covariance 

*  between  the  n ’ th  and  m 1 th  Noll-indexed  zernike  modes  and  assuming 

*  Kolmogorov  turbulence.  This  can  be  used  to  build  the  covariance  matrix, 

*  which  in  turn  can  be  used  to  generate  phase  screens.  To  get  the  true 

*  covariance ,  this  needs  to  be  multiplied  by  (D/rO)  ~ (5/3) . 
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* 

*  inputs : 

*  int  n  ==  noil-index  of  the  first  mode 

*  int  m  ==  noil-index  of  the  second  mode 

* 

*  output : 

*  double  C  ==  covariance  of  the  requested  modes 
*/ 


void  mexFunc t i on ( int  nlhs , mxArr ay  *plhs  []  , int  nrhs , const  mxArray  *prhs  []  )  ; 
/*  mex  wrapper  function  */ 


double  noll_covar iance ( int  i,  int  j) 

{ 

double  C,  A,  B,  D;  //  working  doubles 

int  ni ,  mi,  nj  ,  mj ;  //  radial  and  angular  orders  for  the  modes 

int  tmp  ,  one  ; 

//  compute  the  radial  and  angular  mode  numbers 
zernikemode  ( i  ,  &ni  ,  &mi); 
zernikemode  ( j  ,  &nj  ,  &mj  )  ; 

//  see  if  we  even  need  to  bother 
if  (  (  (  i- j  )  °/02  =  =  O)  &&  (mi=  =  mj  )  ) 

//  get  the  exponent  on  the  negative  1  and  decide  if  -l~m  is  negative 
tmp  =  (ni  +  nj  -  2*mi)/2; 
if  (tmp°/02  ==  0) 
one  =  1 ; 
else 

one  =  -1; 

A  =  0 . 0072* one  * sqrt (( ni  +  1 )*( nj + 1 )) *pow (M_PI  ,( double ) 8/3)  ; 

B  =  tgamma (( double ) 14/3) * tgamma (( ni +nj -( double ) 5/3) /2) ; 

D  =  tgamma ( (ni -nj +( double ) 17/3) /2) * tgamma (( nj -ni +( double ) 17/3) /2)  * 
tgamma ((ni+nj+(double)23/3)/2) ; 

C  =  A*B/D ; 

> 

else 

C  =  0; 
return  C; 


void  mexFunc t i on ( int  nlhs, mxArray  *plhs  []  , int  nrhs , const  mxArray  *prhs  []  ) 

{ 

int  n  ,  cO ,  cl  ; 
double  C,  *M; 

/*  check  for  complex  input  data  */ 
if  (mxIsComplex (prhs  [0] ) ) 

mexErrMsgTxt (" Input  must  be  real"); 

/*  error  check  the  number  of  input  and  output  arguments  */ 
if  (nrhs  !=  1) 

mexErrMsgTxt ("need  1  input  arg  :  number  of  modes"); 
if  (nlhs  !=  1) 

mexErrMsgTxt ("need  1  output  args :  noil-matrix") ; 

/*  figure  out  how  many  modes  we  are  processing  */ 
n  =  * (mxGetPr ( prhs  [0] ) )  ; 

/■ *  make  the  output  array  to  go  back  to  matlab  and  set  the  pointers  */ 
plhs  [0]  =  mxCreat eDoubleMatr ix (n , n , mxREAL )  ; 

M  =  mxGetPr ( plhs  [0] )  ; 
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/*  build  up  the  noli  covariance  matrix  */ 
for  (c0=0;  cO<n;  c0++) 

{ 

for(cl=cO;  cl<n;  cl++) 

103  { 

C  =  no 11_ co var i anc e ( cO+2 , c 1 +2) ; 

* (M+ c 1 +n* cO )  =  C; 
if  (cO  !=  cl) 

* ( M  +  cO  +  n*  c 1 )  =  C; 

108  } 

> 


return ; 

113 

} 


B.2  Data  Generation  and  Reduction  Code 


This  section  contains  code  used  to  generate  and  post-process  the  data  used  for  the 
SNR  vs.  frames  analysis  shown  in  Fig.  6.22.  The  snr_f rms.test  and  batchsub.sh  scripts 
are  typical  of  the  scripts  used  to  run  all  of  the  scenarios  described  in  chapter  VI.  The  data 
generation  scripts  were  run  on  a  64  node  dual  Opteron  248  Linux  cluster  and  a  45  node 
Athlon  3000+  Linux  cluster  using  the  PBS  batch  processing  system.  Additional  runs  were 
done  on  a  dual  2.8  GHz  Pentium  4  Linux  workstation  and  a  dual  3.02  GHz  Pentium  4  Linux 
workstation  using  screen  and  bash  scripting. 

Listing  B.9:  snr_f rms.test  for  bash  or  PBS 

# ! /bin/b ash 
#PBS  -m  ae 

#PBS  -M  peter.johnson@afit.edu 

#PBS  -l  no  des  =1  :  ppn=l ,  wal  1 1  ime  =4.0000 :  00 

5  #PBS  -S  /bin/bash 

#PBS  -r  n 

#PBS  -V 
#PBS  -e  err 

#PBS  -0  out 

10  #  this  file  is  for  running  the  APDI  alg  on  a  given  dataset  for  several 

#  conditioning  bias  levels  for  comparison.  The  resulting  output  files  are 

#  gzip  3  d  together  and  stored  in  a  directory  for  later  analysis 

#  unset  the  DISPLAY  env  var 
15  unset  DISPLAY; 


#  set  up  some  other  variables 

export  H0ME=/home/afit6/ engphd07 /pjohnson 

export  PR0C=cuname  -p 1 ;  #  decide  what  arch  we  are  running  on... 

20  DATE=‘date  +0/#G#/Om0/Od 1  ;  #  date  the  script  was  run  for  filename 


#  see  if  the  number  of  iterations 
if  [  !  -n  " $ ITERS "  ];  then 
ITERS  =  100 ; 

25  fi 


was  defined . . . 

#  number  of  realizations 


to  generate 
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#  check  to  see  that  we  got  snr , 
if  [  -n  "$SNR"  ];  then 
export  SNR 

else 

export  SNR=7 ; 
f  i 

if  [  -n  " $FRM "  ];  then 
export  FRM ; 

else 

export  FRM=30; 
f  i 

if  [  -n  "  $SGMA "  ];  then 
export  SIGMA 

else 

export  SGMA=6; 
f  i 


frms,  and  sigma  from  the  environment 

#  get  the  SNR  from  the  caller 

#  otherwise  set  a  default  SNR 

#  get  number  of  frames  from  caller 

#  otherwise  set  a  default  number  of  frms 

#  get  the  detector  sigma  from  the  caller 

#  otherwise  set  a  default  detector  Sigma 


#  set  the  directory  to  save  the  results  in 

BASEDIR=$HOME/dynamicaberr_data/snr_frms_test 
SAVEDIR=$BASEDIR/${SNR}_snr/${FRM}f rm/${SGMA> sgma 


# 

make  sure  the  directory  is  there  to  save 

stuff 

if 

[  !  -d 

$BASEDIR/${SNR}_snr  ] ;  then 

mkdir 

$BASEDIR/${SNR}_snr 

f  i 

if 

[  !  -d 

$BASEDIR/${SNR}_snr/${FRM}frm  ]; 

then 

mkdir 

$BASEDIR/${SNR}_snr/${FRM}frm 

f  i 

if 

[  !  -d 

$SAVEDIR  ] ;  then 

mkdir 

$SAVEDIR 

f  i 

in 


#  figure  out  which  directory  to  run  in  from  the  environment  ,  devault  to  1  if 

#  not  set.  This  is  so  we  don’t  overwrite  files... 
if  [  -n  " $W0RKDIR "  ];  then 

WORK=$BASEDIR/ runtmp/ $W0RKDIR 

else 

W0RKDIR=1 

WORK =$BASEDIR/ runtmp /$W0RKDIR 
f  i 

#  make  the  directory  if  it  isn’t  there 
if  [  !  -d  $WORK  ] ;  then 

mkdir  $WORK ; 
f  i 

#  change  directory  to  the  working  directory 
cd  $WORK  ; 

#  set  up  architecture  dep  endent  stuff .  .  . 
if  [[  $PROC  =  x86_64  ]];  then 

MATLAB= "/ apps / Linux86_64 /mat Iabl4sp3 /bin/mat lab  -nodisplay  >&  /dev/null" 

APDI=$H0ME/bin/APDIsolve_64 

echo  "64-bit  Processor" 

else 

MATLAB  =  "/apps/Linux86/matlabl4sp3/bin/matlab  -nodisplay  >&  /dev/null" 
if  [[  $PR0C  ==  athlon  ]];  then 

APDI=$HOME/bin/ APDI solve. at hlon 
echo  "AMD  Processor" 

else 

APDI=$HOME/bin/APDIsolve 
echo  "Intel  Processor" 
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f  i 
f  i 

#  loop  over  the  iterations 
for  ( ( i  =  0 ;  i<$ITERS;  i+  +  ));  do 


#  call  matlab  to  write  the  data  files  ..  . 

$MATLAB  <<  EOF 

proc  =  get env  (  *  PROC  ’  )  ;  7,  get  the  arch 

home  =  get  env  (’  HOME  *  )  ;  7,  find  home 

if  strcmp (proc  ,  ’ x86_64  ’  )  °/0  set  proc  depend  stuff 

addpath ( genpath ( str cat (home ,  ’ / lib64  ’))) ; 
fprintf (,\nx86_64\n,) ; 
else 

addpath(genpath(strcat(home ,  ’/lib’))) ; 

fprintf (’\ni686\n5) ; 

end 

rand  (’  st ate  sum  ( 100*  clock ))  ;  /  initialize  rand  number  generator 

randn (’ state  sum ( 100* clock ))  ;  /  same  for  gaussian  random  numbers 
snr  =  str2num  ( get  env  (  ’  SNR  ’  )  )  ;  1  get  the  SNR 

frms  =  str2num  ( get  env  (  ’  FRM  ’  )  )  ;  °/  get  #  of  frames 

det_sigma  =  str2num  (  get  env  (  ’  SGMA  *  )  )  ;  7,  gaussian  noise  std 

writePDdata (snr , frms , det_sigma) ;  7#  generate  data  and  write  output  files 

7«  remove  the  unneeded  guess  data 
!  rm  guess . dat 

exit  ; 

EOF 

wait 

#  call  the  APDI  algorithm 
$  APDI 

wait 

#  set  the  filename  for  this  realization 

FILENAME  =  ${DATE} _$ { WORKDIR} _$ { SNR} snr_$ {FRM} f  rms_$ { i } . tgz 

#  tar-gzip  the  data  files  together  and  move  them 
tar  -czf  $FILENAME  * . dat 

rm  * . dat 

mv  $FILENAME  $SAVEDIR 


done 


Listing  B.  10:  batchsub.sh 

# ! /bin/bash 

#  Peter  Johnson 

#  this  launches  a  bunch  of  runs  to  the  pbs  server  for  queing 

SNRVALS  =  (  2  3  5  7  10  15  20  ); 

snr len=7 ; 

FRMVALS  =  (  10  20  30  40  50  )  ; 

f  rmlen  =  5 

export  SGMA=100; 

for  ( ( i=0 ; i<$snrlen ;  i++));  do 
export  SNR=${ SNRVALS [${i}] } 
for  (( j =0 ; j < $f rmlen ;  j++));  do 
export  FRM=${ FRMVALS [${ j }] } 
export  W0RKDIR=${i}${ j } 
qsub  snr_f rms_t est 
wait 

done 

done 


177 


1 

6 

11 

16 

21 

26 

31 

36 

41 

46 

51 

56 

61 


ListingB.il:  datareduce  .m 


°/0  Peter  Johnson 
%  23  May  2006 
°/t  datareduce  .  m 

X 

X  this  file  cycles  through  the  results  directory  and  grinds  up  the  data  to 
°/0  get  the  MSE,  mean  MSE,  and  s  t  andar  d_  dev  i  at  i  on 

clear;  clc;  close  all  hidden; 

°/0  set  this  to  to  save  the  graphics  to  files... 
prnt  =  l;  °/0  set  to  1  to  save  individual  pics 

subfgs  =  0;  °/0  set  to  0  to  have  all  the  figures  in  individual  windows 


°/o  set  up  parameters 

fname  =  ’ 64x64_mse . csv  ’  ;  X  file. 

SNR  =  {’2’,  ’  3  ’  ,  ’  5  ’  ,  ’7’,  >10’,  ’15 

snrval  =  [2,3,5,7,10,15,20]; 

frms  =  {’10’,  5  20  5  ,  5  30  5  ,  5  40  ’  ,  ’50’ 

frmval  =  [10,20,30,40,50]; 

sgma  =  {  ’  6 ’  ,  ’ 100 ’ } ; 

sgmaval  =  [6  100] ; 

base  =  ’  .  . / ’ ; 

workdir  =  strcat (base ,  ’tmp/ ’ ) ; 
stordir  =  strcat (base ,  *  analysis /’) ; 
ind  =  1 : length ( snrval ) -1 ; 

X  make  some  storage  arrays 

11  =  length (SNR)  ; 

12  =  length ( frms ) ; 

13  =  length ( sgma ) ; 

raw_mse  =  zeros ( 11 , 12 , 13 )  ; 
adj_mse  =  zeros (11 , 12 , 13)  ; 
scl_fact  =  zeros ( 11 , 12 , 13 )  ; 

raw_mse_s igma  =  zeros ( 11 , 12 , 13 ) ; 
ad j _mse_s igma  =  zeros ( 11 , 12 , 13 ) ; 
slc_f act _sigma  =  zeros ( 11 , 12 , 13 ) ; 

raw_mse_stderr  =  zeros ( 11 , 12 , 13 )  ; 
adj _mse_stderr  =  zeros ( 11 , 12 , 13 ) ; 
scl_f act _stderr  =  zeros (11 , 12 , 13)  ; 


name  for  results 
,  ’20’};  X  snr  values 

;  °/0  the  number  of  frames 

°/0  detector  variances 

X  base  dir 

°/0  working  directory 

°/0  directory  to  store  results  in 


mean_time  =  zer os ( 11 , 12 , 13 ) ; 
mean.t ime.sigma  =  zeros (11 , 12 , 13) ; 
mean_t ime_stderr  =  zeros ( 11 , 12 , 13 ) ; 

X  define  the  awk  command  to  look  for  the  run  time  int  the  iterate.dat  file 
cmd  =  [’echo  ‘ awk  ’, char  (39)  ,’ /Total  User  time/{print  $4}’,... 
char (39),’  iterate.dat*’]; 

X  open  the  file  to  store  the  results  in  and  write  colum  titles 
resfile  =  f  open  (  strcat  (  stordir  ,  fname  ),’  w  ’)  ; 
fprintf (resfile  ,  .  .  . 

’File ,Frms ,SNR, SGMA , Modes ,Div ,Dr0 , Unproc_MSE ,Scl_Cnst ,Adj_MSE ,Time\n’) ; 

°/0  change  to  the  work  directory 
cd (workdir ) ; 

X  cycle  through  the  frame  numbers  and  SNR’s 
for  c0=l : length ( SNR) 
for  cl =1 : length ( frms ) 
for  c4=l : length ( sgma) 
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srcdir  =  char ( strcat (base , SNR ( cO)  ,  ’ _snr / ’  , f rms ( cl )  ,  * f rm/ ’  ,  .  .  . 

sgma ( c4 )  ,  ’ sgma/ ’  ) )  ;  X  src  dir 
fls  =  dir  (  strcat  (  srcdir  tgz  ’))  ;  X  file,  listing 

fls  =  {fls. name};  °/0  strip  out  names 

nruns  =  length(fls); 
if  nruns  >0 


X  make  a  temp  storage  array  for  the  mse  ’  s 
Xunpro c_ms e_tmp  =  zeros (1  ,  nruns  ( fls) ) ; 
ad j _mse_tmp  =  zeros ( 1 , nruns ) ; 
raw_mse_tmp  =  zeros ( 1 , nruns ) ; 
scl_f act _tmp  =  zeros ( 1 , nruns ) ; 
time_tmp  =  zeros ( 1 , nruns ) ; 


for  c2=l : nruns 

file  =  char ( fls ( c2 )) ; 
fprintf (strcat (file , ’ \n ’ ) ) ; 
copyfile (strcat (srcdir ,file) , ’ . ’  ) ; 
unix (strcat ( ’tar  -xzf  ’,file)); 
delete  (file)  ; 


X  copy  dta  to  wrking  dir 
X  untar  the  data 
X  delete  the  data  file 


X  now  read  in  the  data  and  compute  the  desired  metrics  ..  . 


X  open  the  files  for  reading 
fll  =  f open (’ results . dat ’ ,’ r ’  , ’ ieee -le  ’)  ; 
fl2  =  f  open  (’  truth  .  dat  ’,  ’  r  ’,’  ieee  -  le  ’ )  ; 
fl3  =  f open ( ’ dat a . dat ’  ,  ’  r  ’  ,  ’ ieee - le ’ ) ; 


X  read  in  the  truth  data 


m  =  f read  ( f 12  , 

i , 

’ double  ’  ) 

f  =  f read  ( f 12  , 

i , 

’ double  ’  ) 

k  =  f read  (f 12  , 

i , 

’ double  ’  ) 

DrO  =  fread(fl2,  1,  ’double’); 
div  =  fread(fl2,  1,  ’double’); 

trth  =  reshape ( f read ( f 12 ,  m*m,  ’ double ’), m , m) ; 
f  close ( f 12 )  ; 


X  read  in  the  results 
m  =  fread(fll,  1,  ’double’); 
f  =  fread(fll,  1,  ’double’); 
k  =  fread(fll,  1,  ’double’); 
DrO  =  fread(fll,  1,  ’double’); 
div  =  f read  (fll  , 1 ,’ double ’)  ; 
obj  =  reshape ( f read  (fll  ,  m*m, 
f  close ( f 11 )  ; 


X  frame  dimension 
X  number  of  frames 
X  number  of  aberrations 
X  D/rO 
X  diversity 

’  double ’),  m  ,  m)  ;  X  resulting  obj 


)  ; 


X  read  in  the  unprocessed  data 
m  =  fread(fl3,  1,  ’double’); 
f  =  f read ( f 13 ,  1,  ’double’); 

k  =  fread(fl3,  1,  ’double’); 
DrO  =  fread(fl3,  1,  ’double 
div  =  f read ( f 13 , 1 ,’ double  ’  ) 
tmpdata  =  reshape (f read (f 13 
f  close ( f 13 )  ; 
focal  =  zeros (m) ; 
for  f rmcntr =1 : f 

focal  =  focal  +  tmpdata(: 
end 

focal  =  f  ocal . /f ; 
clear  tmpdata; 


X  frame  dimension 
X  number  of  frames 
X  number  of  aberrations 


X  D/rO 
X  divers ity 

2*m*m*f , ’double ’ 


, 2*  f  rmcntr - 1 )  : 


,  2* f )  : 


X  data 


X  compute  the  raw 
unproc_mse_tmp ( c2 ) 
r aw_mse_tmp ( c2 ) 
scl_f act_tmp ( c2)  = 


MSE ,  scale  factor,  and  adjusted  MSE 
=  mean (mean ( (trth-f ocal ).  ~2) )  ;  X  unprocessed 
=  mean (mean (( trth  -  obj).  "2));  X  raw  mse 
sum (sum (trth. *  ob j ) )  ./sum (sum (obj  .-2))  ; 


ms  e 
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obj  =  scl_f  act_tmp  (c2)  .  *obj  ;  °/0  adj  object 
ad  j  _mse_tmp  (  c2  )  =  mean  (mean  (  ( trth-obj  )  . '“2)  )  ;  %  adjusted  mse 

°/0  get  the  computation  time  for  this  run 
[status, t]  =  unix(cmd); 
if  ( length ( t ) >1) 

time_tmp(c2)  =  str2num(t); 
else 

time_tmp(c2)  =  NaN ; 
end 

°/0  write  the  results  to  the  output  file 

fprintf(resfile  ,strcat(file  ,  ’  ,#/«g,  ’  , char  (SNR  (cO))  ,  3  ,  ’  ,  .  .  . 

char  (  sgma  (  c4)  )  ,  ’  ,°/,g  ,°/,g  ,°/,g  ,7,g  ,  °/,g  ,°/,g  ,°/og\n  5 )  ,  .  .  . 
f,k,div,DrO,  unproc_mse_tmp ( c2 )  ,  s c l_f act _ tmp ( c2 )  ,  ... 

ad j _mse_tmp ( c2 )  , t ime_tmp  (  c2 ) )  ; 

°/0  clean  up  tmp  directory 
unix ( 3 rm  * . dat 3 ) ; 
end 

°/0  compute  the  statistics 

raw_mse (cO , cl , c4)  =  mean ( r aw_mse_tmp )  ; 

raw_mse_sigma (cO , cl , c4)  =  std ( raw_mse_tmp )  ; 

raw_mse_stderr ( cO , cl , c4)  =  raw_mse_sigma (cO , cl , c4) /sqrt (nruns ) ; 

adj _mse (cO , cl , c4)  =  mean ( ad j _mse_tmp )  ; 
adj _mse_sigma (cO , cl , c4)  =  std ( ad j _mse_tmp )  ; 

ad j _mse_stderr ( cO , cl , c4)  =  adj _mse_sigma (cO , cl , c4) /sqrt (nruns ) ; 

s cl_f act ( cO , cl , c4 )  =  mean ( scl_f act_tmp)  ; 
scl_f act.sigma (cO , cl , c4)  =  std (scl_f act_tmp)  ; 

scl_fact_stderr(c0,cl,c4)  =  scl_f act _ sigma (cO , cl , c4) /sqrt (nruns ) ; 

timeind  =  f ind ( i snan ( t ime_tmp ) ==0 ) ; 

mean_time (cO , cl , c4)  =  mean ( t ime _tmp ( t ime ind )) ; 

mean_t ime_s igma (cO , cl , c4)  =  std ( t ime_tmp ( t ime ind ) ) ; 

mean_time_stderr(c0,cl,c4)  =  mean_t ime _ sigma ( cO , cl , c4) / sqrt (nruns ) ; 
else 

raw_mse (cO , cl , c4)  =  NaN; 
raw_mse_sigma (cO , cl , c4)  =  0; 
raw_mse_stderr ( cO , cl , c4 )  =  0; 

adj _mse ( cO , cl , c4)  =  NaN; 
adj _mse_sigma (cO , cl , c4)  =  0; 
adj _mse_stderr (cO , cl , c4)  =  0; 

s cl_f act ( cO , cl , c4 )  =  NaN; 
scl_f act.sigma (cO , cl , c4)  =  0; 
slc_f act_stderr (cO , cl , c4)  =  0; 

mean.time (cO , cl , c4)  =  NaN; 
mean_t ime_s igma ( cO , cl , c4)  =  0; 
mean_t ime_stderr ( cO , cl , c4)  =  0; 
end 
end 
end 
end 

°/0  close  the  files  . 

f  close  (  3  all  3 )  ;  °/0  just  in  case... 

°/0  save  the  results  for  later  playing 
save  snr_f rm_r educed.dat a 

°/0  return  to  the  analysis  directory 
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cd ( st ordir ) 


°/0  plot  some  of  the  reduced  results 

set(0,  ’DefaultAxesLineStyleOrder’  ’ , ’-o’  , ’ -x ’ , ’  -+  ’  , ’ -d ’ }) 

linestyles=[’b-~’  ;  ’g-*  ’  ;  ’r-o  ’  ;  ’k-p  ’  ;  ’  c-d  ’  ;  ’m-+  ’  ]  ; 

°/0  expand  the  snrval  matrix 

snrmat  =  r epmat ( snrval  ’, 1 , length ( f rmval ))  ; 

°/0  make  on  plot  with  all  the  mean  MSE  ’  s  v.s.  SNR  and  frames 
for  c33=l : length ( sgma) 
hi  =  figure ; 
hold  on 

s  =  s ize ( snrmat )  ; 
for  c0=l:s(2) 

errorbar (snrmat (ind , cO)  , adj_mse ( ind , cO , c33 )  ,  .  .  . 

adj_mse_stderr(ind,c0,c33) ,  linestyles(cO, :)) 

°/0plot  ( snrmat  (ind  ,  cO )  ,  adj_mse  (ind  ,  cO  ,  c33 )  ,  l  inestyl  es  (cO  ,  : )  )  ; 
end 

hold  off 

xlabel ( ’ SNR  ’  ,  ’FontSize  ’  ,14)  ; 
y label (’MSE’  ,  ’FontSize ’  ,14)  ; 

title([’MSE  v.s.  Detection  SNR,  \ sigma_d= ’, char ( sgma ( c33) )],’ FontSize ’,  14) 
a  =  axis ; 

a(2)  =  max ( snrval ( ind) ) +1 ; 
a(l)  =  min ( snrval ( ind) ) -1 ; 
axis (a) 

grid  on; 

legend ( [ char (frms  (1) )  ,  ’  frms  5]  ,  [char (frms  (2) )  ,  ’  frms  ’]  ,  .  .  . 

[char (frms (3) )  ,  ’  frms’]  ,  [char (frms (4) )  ,  ’  frms  ’]  ,  .  .  . 

[char (frms(5)) , ’  frms’] , ’Location’ , ’NorthEast’) ; 

if  prnt 

prntfname  =  strcat(char(sgma(c33)) , ’ sgma_snr_frms_mse_dynamic.eps’) ; 
print (hi , ’-depsc ’ , prntfname ) ; 
end 


°/0  make  on  plot  with  all  the  mean  times  v.s.  SNR  and  frames 
h2  =  figure ; 
hold  on 

s  =  s ize ( snrmat )  ; 
for  c0=l:s(2) 

erro rb ar (snrmat (ind, cO)  ,mean_time(ind,c0,c33)  ,  .  .  . 

adj_mse_stderr(ind,c0,c33) ,  linestyles(cO, :)) 

°/0  plot  (snrmat  (ind  ,  cO)  ,  mean_time  (ind  ,  cO ,  c33 )  ,  lines  tyl  es  (cO  ,  : )  )  ; 
end 

hold  off 

xlabel(’SNR’  ,  ’FontSize’  ,14)  ; 

ylabel ( ’ Comput at  ion  Time  (  s  )  ’  ,  ’  Font S ize  ’  ,  14)  ; 

t itle (  [  ’ Time  v.s.  SNR,  \ sigma_d= ’, char ( sgma ( c33 ))],’ FontSize ’,  14) 
a  =  axis ; 

a(2)  =  max ( snrval ( ind) ) +1 ; 
a(l)  =  min ( snrval ( ind) ) -1 ; 
axis (a) 
grid  on; 

legend ( [ char (frms  (1) )  ,  ’  frms  ’]  ,  [char (frms  (2) )  ,  ’  frms’],... 

[char (frms(3))  ,  ’  frms’]  ,  [char (frms (4) )  ,  ’  frms’]  ,  .  .  . 

[char (frms (5) ) , ’  frms’] , ’Location’ , ’ne ’) 
if  prnt  &&  ~subfgs 

prntfname  =  strcat (char(sgma(c33)) , ’ sgma_snr_frms_time_dynamic . eps ’) ; 
print (h2 , ’ -depsc ’ , prntfname ) ; 
end 
end 
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°/0  put  out  a  plot  with  the  computation  time  for  the  noiseless  case  v.s.  frms 
h3  =  figure 

266  err orb ar (f rmval  , mean.time (end-1 ,  :  ,1)  ,mean_time_stderr (end-1 ,  :  ,  1) ) 
xlabel( ’Frames ’ , ’Font Size ’ ,14) 
ylabel(’Time  ( s )  ’  ,  ’ Font S ize ’  ,  14) 

title(’Mean  Computation  Time  vs.  Number  of  Frames ’,’ FontSize ’,  14) 
grid  on 
271  if  prnt 

prnt f name  =  str cat ( char ( SNR ( end -1 ) )  ,  ’ snr _f rms_t ime_l ineplot  _dynami c . eps  ’  ) 
print  (h.3  ,  ’  -depsc  ’  ,  prnt f  name  ) 
end 
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Far-Field  Approximation,  25 
Fraunhoffer  Diffraction,  25 
Fresnel  Diffraction,  25,  35 
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Simulation,  69 
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data.  Both  an  EM-algorithm  and  a  lensless- APDI  approach  are  presented  for  generating  imagery  directly  from  PP 
polarization  measurements.  However,  both  approaches  are  currently  impractical.  Suggestions  for  improvement  are 
offered.  Finally,  the  APDI  algorithm  is  modified  to  use  PP  polarization  data  in  place  of  PP  intensities.  An  initial 
statistical  model  is  offered,  and  suggestions  for  performance  improvement  are  presented. 
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